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The present invention relates generally to * method of characterising 
a genomic DNA sample and to nucleotide sequences employed In the 
method. In particular the invention involves the use of nucleotide 
sequences comprising oligonucleotides hybridisable to regions adjacent 
informative genetic loci. The method of the invention may for example 
be used la paternity disputes, forensic medicine or in the prevention, 
diagnosis and treatment; of genetic disorders or predispositions. The 
method is of particular use where only es little as one molecule of an 
informative locus is present in the genomic DNA sample. 

1. A method of characterising a test sample of genomic DNA by 
reference to one or more controls, which method comprises amplifying 
the mini satellite sequence at at least one informative locus in the 
test sample by 

(1) hybridising the test sample with primer in respect of each 
Informative locus to be amplified, the primer being hybrldisable to a 
single strand of the test sample at region which flanks the 
minisatellite sequ nee of the informative locus to be amplified under 
conditl ns such that an extension product of the primer is synthesised 
which la complementary to and spans the said minisatellite sequence of 
the strand f the test sample; 
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(II) separating the extension pro duet to formed tram the template on 
which It was syntheslsed to yltld single stranded molecules; 

(III) If required hybridising the primer of step (1) with single 
stranded molecules obtained according to step (11) under conditions 
such that a primer extension product is syntheslsed from the template 
of at least one of the single stranded molecules obtained according to 
step (11), and 

(lv) detecting the amplification products and comparing them with one 
or more controls; 

the method being effected such that sufficient of the desired 
extension product is generated to be detectable but such that the 
yield of extension product is Inadequate to permit substantial out- of- 
register hybridisation between complementary mlnisatellite template 
strands . 

9. A polynucleotide hybridisable to a single strand of a test 
sample of genomic DNA at a region which flanks the mlnisatellite 
sequence of an informative locus so as to permit formation of an 
extension product which is complementary to and spans the said 
mlnisatellite sequence of the strand of the test sample and wherein 
the informative locus is selected from any one of the following: 
MSI, HS29, KS31A and MS31B, MS32 f HS43A and MS43B, MS51, XS228 and 
MS228B. 

10. A polynucleotide extension product prepared from a 
polynucleotide primer hybridisable to a single strand of a genomic DNA 
C9st sample at a region which flanks the mlnisatellite sequence of an 
informative locus and wherein the polynucleotide extension product is 
complementary to and spans the said mlnisatellite sequence of the 
strand of the test sample. 

13. A kit comprising polynucleotide primer for amplifying the 
mlnisatellite sequence at at least one informative locus in a test 
sample of genomic DNA the primer being hybridisable to a single strand 
of the test sample at a region which flanks the mlnisatellite sequence 
of the lnf rmative locus t be amplified under conditions such that an 
extension product of the primer Is syntheslsed which is complementary 
to and spans the said mlnisatellite sequence of the strand of the test 
sample; and the kit further comprising a control sample of DNA and 
instructions f ruse. ■ ■ :-r-^^i''^ " 
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TITLE; EXTENDED NUCLEOTIDE SEQUENCES 

The present invention relates generally to a method of characterising 
a genomic ONA sample and to nucleotide sequences employed in the 
method. In particular the invention involves the use of nucleotide 
sequences comprising oligonucleotides hybrldlsable to regions adjacent 
informative genetic loci. The method of the invention may for example 
be used in paternity disputes, forensic medicine or in the prevention, 
diagnosis and treatment of genetic disorders or predispositions. The 
method is of particular use where only as little as one molecule of an 
informative locus Is present in the genomic DNA sample. 

Methods of genetic characterisation are known in the art. In UK 
patent no. 2166445 (Lister Institute) there are described various DMA 
sequences which may be used as probes to hybridise simultaneously to a 
number of polymorphic sites within animal, e.g. human, and 
plant genomes enabling the production of a *DNA fingerprint* composed 
of marked DNA bands of differing molecular weights* The DNA 
fingerprint as a whole is characteristic of the individual concerned 
and the origin of the differing bands can be traced through the 
ancestry of the individual and can in certain cases be postulated as 
associated with certain genetic disorders. In European Patent 
Application, Publication No. 238329, there are described various DNA 
sequences which may be used as probes to hybridise individually at 
individual polymorphic sites within the animal, for example human 
genome ♦ Methods of genetic characterisation using one or more of such 
probes are described. 

Tandem-repetitive minlsatelllta regions in vertebrate DNA frequently 
show high levels of allelic variability in the number of repeat units 
[1-4]. Hybridization probes capable of detecting multiple 
minisatellites and producing individual •specific DNA fingerprints have 
been developed (5*7), as well as cloned human minisatellites which 
provide locus •specific probes for Individual hypervar labia loci (5, 8* 
10 J . These highly informative genetic markers have found widespread 
application in many areas of genetics, including linkage analysis [9, 
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11*13], determination £ kinship In for enample paternity end 
immigration disputes [6, 10, 14, 13] , monitoring bone marrow 
tranaplanti [16,17], and for individual Identification in f renslc 
medicine [10, 18, 19]. Applications to typing forensic samples such 
as blood and senen strains or hair roots are however limited by the 
sensitivity of che hybridization probes, which require at least SOng 
of relatively i<ndegraded human DNA for typing with locus-specific 
minisatellite probes [10] and 0.01* g DNA for analysis with multllocus 
DNA fingerprint probes (6), 

Where a sufficient amount of sample DNA is available in the test 
sample tho above disclosures provide valuable and reliable methods of 
genetic characterisation. However the efficiency of the above 
techniques is reduced where the amount of genomic DNA in the test 
sample is limited, for example in forensic applications where often 
only small copy numbers of the test DNA molecule are available. 

It Is therefore desirable to provide a further method of genetic 
characterisation which is especially suitable for small samples of 
genomic DNA* 

K. Kleppe ££ &1 in J. Hoi. Biol. (1971), ££, 341-361 disclose a method 
for the amplification of a desired DNA sequence. The method Involves 
denaturation of a DNA duplex to form single strands. The denaturatlon 
step is carried out in the presence of a sufficiently large excess of 
two nucleic acid primers which hybridise to regions adjacent the 
desired DNA sequence. Upon cooling two structures are obtained each 
containing the full length of the template strand appropriately 
complexed with primer. DNA polymerase and a sufficient amount of each 
required nucleoside triphosphate are added whereby two molecules of 
the original duplex are obtained. The above cycle of denature ion, 
primer addition and extension are repeated until the appropriate 
number of copies of the desired DNA sequence is obtained. It Is 
indicated that adjustment of the primer concentration may be required. 
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Thus It can be predlcttd that hybridisation will take plaea between 
amplificati n products thus resulting In networking and the premature 
termination of the PGR reaction at an annealed site. Such Incomplete 
PGR products could then act, by out of register annealing to the 
complementary mlnisatelllte strand, as a substrate for extension in 
the next cycle of amplification. This will result in the generation 
of multiple spurious amplification products. The application of PCR 
to the amplification of minisatellites , particularly long 
minisatellite alleles containing many repeat units, is thus likely to 
involve the loss of sequence fidelity and yield such a large number of 
amplification products that the results of such a process are neither 
meaningful nor accurately reflect the minisatelHte alleles in the 
starting genomic DNA. 

Whilst E. Boerwinkle and L. Chan (1988) in an abstract given out at 
the 39th Annual Meeting of the American Society of Human Genetics in 
New Orleans (October, 1988) (Abstract (0548) U.SJ refer to the use of 
PCR on tandemly repeated hypervarlabie loci, the size of the targeted 
region which they employed was relatively small, being always less 
than one kilobase in length. Moreover they do not disclose the way 
in which they applied the PCR technique to achieve faithful 
reproduction of sequences and the production of a meaningful and 
accurate result. S. Odelberg t£ Al» in a disclosure to the American 
Academy of Forensic Science (February, 1988) on the other hand have 
described their unsuccessful application of PCR to larger 
minisatellites and their findings confirm the predicted difficulties 
discussed hereinbefore since they are unable to use PCR to obtain any 
meaningful or useful result in view of the multiplicity of 
amplification products detected* 

The present invention is based on the discovery that as little as one 
molecule of an informative genetic locus in a genomic DMA sample may 
be faithfully amplified many times to yield amplification products 
which are useful for genetic characterisation purposes by effecting 
the amplification within a defined window such that sufficient of the 
desired extension product Is generated to be detectable but that the 



yield of extension pr duet is inadequate to permit substantial ut-of • 
register hybridisation between complementary tandem* repeated template 
strands. 

According to one feature of the present invention there is provided a 
method of characterising a test sample of genomic DNA by reference to 
one or more controls, which method comprises amplifying the 
minisatellite sequence at at least one Informative locus in the test 
sample by 

(i) hybridising the test sample with primer in respect of each 
informative locus to be amplified, the primer being hybridisable to 
a single strand of the test sample at a region which flanks the 
minisatellite sequence of the informative locus to be amplified under 
conditions such that an extension product of the primer is synthesised 
which is complementary to and spans the said minisatellite sequence of 
the strand of the test sample; 

(li) separating the extension product so formed from the template on 
which it was syntheslsed to yield single stranded molecules; 

(ill) if required hybridising the primer of step (i) with single 
stranded molecules obtained according to step (11) under conditions 
such that a primer extension product is syntheslsed from the template 
of at least one of the single stranded molecules obtained according to 
step (ii)» and 

(iv) detecting the amplification products and comparing them with one 
or more controls; 

the method being effected such that sufficient of the desired 
extension product is generated to be detectable but such that the 
yield of extension product is Inadequate to permit substantial out-of - 
register hybridisation between complementary minisatellite template 
strands. 



It will b« appreciated that the cycle of hybridising primer to at 
least ona of tha alngla stranded molecules obtained by separation of 
the extensl n product formed froa tha template on which It was 
syntheslsed and then further synthesis of extension product may be 
repeated as few or as many times as Is consistent with the need, on 
the one hand, to generate sufficient extension product to ba 
detectable and on the other hand to generate • yield of extension 
product Inadequate to permit substantial out-of -register hybridisation 
between complementary ainisatelllte template strands. 

Where the method of the present Invention Is effected using one 
primer In respect of each Informative locus it may be advantageous to 
subject the test sample of genomic ONA to restriction prior to 
amplification. In this regard where only one primer is used in 
respect of an informative locus, extension products of varying length 
are likely to be formed. For example a series of products may be 
formed having the same 5' terminus, but different 3' termini. A 
cleaner profile of product is obtainable if the test sample of genomic 
DNA is subjected to restriction prior to amplification. Where 
restriction is effected any appropriate restriction endonuclease may 
be employed. Since the sequence flanking the relevant informative 
locus will be known and the recognition sequence of known restriction 
enzymes is also known, the skilled man will encounter no difficulty In 
selecting an appropriate restriction enzyme for use. Where only one 
primer is used In respect of each Informative locus arithmetical 
amplification will be achieved, but where two such primers are 
employed exponential amplification may be obtained. The method of 
the present invention is thus preferably effected using two primers In 
respect of each informative locus to be amplified. 

Thus according to a further feature of the present Invention there is 
provided a method of characterising a test sample of genomic DMA by 
reference to one or more controls, which method comprises amplifying 
the ainisatelllte sequence at at least one Informative locus in the 
test sample by 



(I) hybridising the test sample vlth two primers in respect of each 
lnf rmative locus to be amplified, each primer being hybrldlsable to 
single strands of the test sample at a region which flanks the 
minisatelllte sequence of the informative locus to be amplified under 
conditions such that an extension product of each primer is 
syntheslsed which is complementary to and spans the said minisatelllte 
sequence of earh strand of the test sample whereby the extension 
product syntheslsed from one primer, when it Is separated from its 
complement, can serve as a template for synthesis of the extension 
product of the other primer; 

(II) separating the extension product so formed from the template on 
which it was syntheslsed to yield single stranded molecules; 

(ill) if required hybridising the primers of step (1) with the single 
stranded molecules obtained according to step (11) under conditions 
such that a primer extension product is syntheslsed from the template 
of each of the single stranded molecules obtained according to step 
(11); and 

(iv) detecting the amplification products and comparing them with one 
or more controls; 

the method being effected such that sufficient of the desired 
extension product is generated to be detectable but such that the 
yield of extension product is inadequate to permit substantial out-of- 
register hybridisation between complementary minisatelllte template 
strands. 

As stated above it will be appreciated that the cycle of hybridising 
primer to each of the single stranded molecules obtained by separation 
of the extension product formed from the testplate on which it was 
syntheslsed and then further synthesis of extension product, may be 
repeated as few or as many times as is consistent with the requirement 
on the one hand to generate sufficient extension product to be 
detectable and on the other hand to generate a yield of extension 



pr duct Inadequate to permit substantial out«of«r«glst r hybridisation 
batvaen c aplementary ainisatelllte taaplata stands. 



An informatlva locus includas raglons of genomic DNA by reference to 
which Individuals can ba distinguished. Hypervarlabla loci, of which 
thera may ba over 1000 in the human genome, may ba useful as 
informative loci. The distinguishing power of an informative genetic 
locus is often referred to in terms of allelic variation or 
polymorphism. Generally, the greater the degree of allelic variation 
or polymcrphisa between individuals the greater the distinguishing 
power of the locus in question. As a convenient guide informative 
genetic loci include those in which at least 3 different alleles can 
be distinguished in any sample of 100 randomly selected unrelated 
individuals. It will be appreciated that the term individual has been 
used above to refer not only to humans, but also to other animals as 
well as to plants and to cell lines derived from such humans, animals 
and plants. In each case, however the sample of randomly selected 
unrelated individuals will all be from the same species, 

In respect of the human applications of the present invention the 
expression "informative genetic locus" as used herein may 
alternatively be defined as one at which at least 3 different alleles 
can be distinguished in DNA extracted from any 20 cell lines selected 
from the following, which cell lines have been deposited with the 
American Type Culture Collection (ATCC):- 



Cell line 



ATCC Deposit No. 



Hela 



ecu 



RPHI 2650 



CCL 30 



Detroit 532 
Detroit 525 
Detroit 529 
Detroit 510 
VI-38 



Citrullinemia 



CCL 54 
CCL 65 
CCL (6 
CCL 72 
CCL 75 
CCL 76 



Cell Una 



ATCC Dap 



EB-3 


CCL 85 


RAJI 


CCL 83 


JZYOYE (P-2003) 


CCL 87 


VI- 26 


CCL 95 


Cecrolt SSI 


CCL 110 


RPMI 6666 


CCL 113 


RPMI 7666 


CCL 114 


CCRF-CEM 


CCL 119 


CCRF-SB 


CCL 120 


HT-1080 


CCL 121 


HC 261 


CCL 122 


CHP3 (M.W.) 


CCL 132 


LL47 (MaDo) 


CCL 135 


HEL 299 


CCL 137 


LL 24 


CCL 151 


HFLI 


CCL 153 


UI-1003 


CCL 154 


MRC-5 


CCL 171 


IMR-90 


CCL 186 


LS 174T 


CCL 188 


LL 86<LeSa) 


CCL 190 


LL 97A (AlMy) 


CCL 191 


HLF-* 


CCL 199 


CC0-13LU 


CCL 200 


CCD-8LU 


CCL 201 


CCD-ilLu 


CCL 202 


CCD-HBr 


CCL 203 


CCD-16Lu 


CCL 204 


CC0-18Lu 


CCL 205 


CCD-WLu 


CCL 210 


Hs888Lu 


CCL. 21 


KRC-9 


CCL 212 


paudl 


CCL 213 


CCD-25LU 


CCL 215 
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Call line 



ATCC Deposit No. 



SV403 



CCL 230 



NAMALWA 



CRL 1432 



All the above-mentioned cell lines ere freely available from the ATCC. 
12301 Parklawn Drive, Rockville, Maryland 20852-1776, USA and are 
listed in the ATCC catalogue of Cell Lines and Hybrldomas. All the 
above-mentioned cell lines were on deposit with the ATCC prior to 
1985. 

Convenient informative loci for use in the present invention include 
loci to which nucleotide sequences and probes disclosed in European 
patent application no. 238329 are capable of hybridisation, as veil as 
those informative loci the flanking sequences of which are disclosed 
in this application. Further informative loci include genomic DNA 
regions identified by minisatellite probes disclosed by 5 J Gendler &£ 
Al. in PNAS, £4, (1987), pages 6060-6064; A particularly informative 
locus is the 5' alpha globin HVR disclosed by the American Journal of 
Human Genetics, 1988, 41, pages 249-256. 

Convenient informative loci for use in the present invent lor^Lnclude 
those of up to 15 kilobases. More convenient informative loci include 
those which provide short alleles, the upper limit being for example 
of less than 10 kilobases particularly of less than 8, more 
particularly less than 6 kilobases. Conveniently the alleles are of 
at least 1, particularly at least 1.5, more particularly at least 2 
kilobases. Thus suitable ranges include 1,1.5, 2, 2.5, or 3, to 6, 
7, 8, 9 or 10 kilobases. In respect of the performance of the method 
of the invention starting with only a single sample DNA molecule it is 
preferred that the length of the alleles does not exceed 6 kilobases. 
Also preferred are informative loci with a restricted range of allele 
length. It has been observed that these preferences are not always 
compatible with highly informative loci since these are usually 
associated with large p abers of minisatellite repeat units and long 
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alleles. The skilled men will however have no difficulty In selecting 
convenient loci for his purpose. 

Primers capable of hybridisation to regions f sample DMA flanking 
Informative loci are required to provide points for the initiation of 
synthesis of extension products across informative genetic loci. Such 
primers are generally oligonucleotides and are preferably single 
stranded for maximum efficiency in amplification, but may 
alternatively be double stranded. If double stranded, the primer is 
first treated to separate it* strands before being used to prepare 
extension polynucleotides. Primers must be sufficiently long to prime 
the synthesis of extension polynucleotides. The exact length of such 
primers will depend on many factors including temperature and the 
nature of the primer. Generally a primer will comprise at least seven 
nucleotides, such as 15-25 nucleotides, for example 20*25 nucleotides. 
It will be appreciated that the flanking sequence need not reflect the 
exact sequence of the flanking region of the informative locus. It Is 
merely necessary that the respective sequences are homologous to a 
degree enabling hybridisation. The extension product so obtained must 
also be capable of acting as a template for further hybridisation with 
a primer . The maximum length of any primer is not believed to be 
critical and is only limited by practical considerations. 

Normally the test sample DNA will be double stranded and it is 
therefore convenient to select primers which hybridise to different 
strands of the sample OKA and at relative positions along the sequence 
such that an extension product synches is ed from one primer can act as 
a template for extension of the other primer into a nucleic acid of 
defined length. 

The flanking regions of certain of the informative genetic regions 
identified by the probes disclosed in UK Patent No, 2166445 and 
European patent application no. 238329 referred to above and 
nucleotide sequences hybridisable thereto have not been previously 
disclosed and both individually and in any combination represent 
further aspects of the present invention. The sequences are set out 
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on pagea preceding the Examples* The informative genetic regi ns are 
indicated n some sequences by the v rd MINI SATELLITE. As explained 
above any convenient sequence nay be us H as a primer. Examples of 
some convenient primer sequences have been underlined. For clarity 
and brevity, on some sequences only the outermost repeat unit on each 
side of each mlnisatellite tandem array is shown (in capitals), 
separated by a series of x's to indicate the omission of most repeats. 
Since there is seldom a whole number of repeat units in each array, 
these outermost repeats may be out of register with each other, but 
preserve the correct relation to the immediate flanking DNA. 

A previously unsuspected tandem repeat region was found by sequencing 
pMS31(EP- 238329)^ this array is separated from the major mlnisatellite 
by 10 bases, and consists of more than 7 repeats of a 19bp G/C rich 
sequence which extends to the &au3AX site defining the end of the 
clone. This region, designated 3 IB, is C*rich on the opposite strand 
to the C-rich strand of the major mlnisatellite. Genomic mapping 
shows that, if variable at all, 31B makes a minimal contribution to 
the length variation at this locus (data not shown). 

Two ioci were detected by pMS228 (Armour a1, 1989, KAR, 12, 13, 
4925-4935) at high stringency. Family analyses showed these loci to 
be tightly linked. Restriction mapping of the cloned DNA, 
subsequently confirmed by sequohce analysis, demonstrated the presence 
of two distinct tandem repetitive regions, designated 228A and 228B 
(Figure 8) . Re probing human DFA with subclones from pHS228 showed 
that 228A detected the larger » more Intensely hybridising locus, and 
228ft the smaller, more faintly hybridising alleles. These 
mlnlstallltas were localised, using somatic cell hybrids and in situ 
hybridisation, to 17pl3-pter. The properties of the loci detected by 
pMS51 and pMS228, and by other minisatellites are summarised In Table 
2. 

Previous evidence for the disposition of a pair of minisatellites in 
pMS43 as descrobed in our UK patent' application 8813781.5 was 
confirmed by sequence data, and two further examples (pMS31 and 
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pMS228) of minisatsllire clones containing two cl sely adjacent tandem 
repeat arrays were elucidated. Unlike pMS43, In which the 'minor' 
locus 43B has & low (30Z) heterozygosity, both the mlnlsatellltes In 
pMS228 have heterozygosities greater than SOX. The oinlsatallite 228B 
(figure 8, table 2) combines a high heterozygosity (85X) with a 
limited allele size range (0.6 «5.Skb); this combination makes It a 
very useful locus for analysis of mlnlsatellltes by the polymerase 
chain reaction (23). In general, the most variable loci have a wide 
range of allele sizes such that many alleles will exceed the maximum 
size currently ampllflable by thU method, so giving an incomplete 
profile. At the locus detected by 228B, in contrast, a survey of 48 
unrelated people shoved that 95X of alleles were smaller than 2kb, and 
that the largest (5.5kb) was well within the ampllflable range (J 
Armour and A Jeffreys, unpublished). Ve have shown that even the 
largest allele at this locus can be amplified, providing the prospect 
of a complete and yet usefully Informative profile from amplification 
at a single locus. 

A novel cloned minisatelllte termed MS29 has been isolated which 
unusually detects two variable loci In human OKA. One locus, located 
In the terminal region of the short arm of human chromosome 6, Is also 
present in great apes. The second minisatelllte locus Is located 
Inters tit ially on chromosome 16pll, and is absent both from non- human 
primates and from some humans. KS29 was Isolated from a L47 genomic 
library made from OKA enriched for minisatelllte sequences as 
described elsewhere (Wong ££ Ann. Hum. Genet. (1987) £1, 260*288. 
KS29 comprises a 39bp repeat sequence. This consists of 13 diverged 
repeats of the trinucleotide YAG wherein Y represents any one of A, C, 
C or T. The flanking sequences of this minisatelllte can be 
elucidated using sequencing techniques known a$X u,. 

In summary the relevant novel flanking sequences are those of MSI, 
MS29, MS31A And KS31B, MS32, MS43A and KS43B, MSS1. MS228A and M5228B. 
In addition we now also provide further novel flanking sequence 
information for the minisatelllte probe p g3 (Vong fit *1, Nucleic 
Acids Research. l£ t 11, (1986), 4605-4616). as veil as for the probes 
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33.1, 33.4 and 33.6 (UK Patent 2166445/Tho Lister Institute of 
Preventive Medicine). 

A particular group of flanking sequences are those of the 
mlnlsatelllte probes KS1, KS29, KS31A and KS31B, KS32, KS43A and 
MS43B. KS51, MS228A and KS228B. 

Further particular groups of flanking sequences are comprised by chose 
of any of the above mlnlsatelllte probes. 

As stated previously the mlnlsatellltes MS29 and MS 3 IB are novel and 
the repeat sequences and/or flunking sequences of either of these 
mlnlsatellltes represent further particular aspects of the Invention. 

The method of the present Invention can be used In respect of any 
convenient Informative locus. Thus, for example primers may be 
prepared for hybridisation to the flanking sequences of for example 
the hypervarlable region 5' to the human insulin gene (Am* J, Hum Genet, 
1986, 22. 291-229), of the 5' alpha globin HVR (Am. J. Hum. Cenet, 1988, 

249-256). of the alpha globin 3 f HVR (EMBCy , 1986, £, 1957-1863), 
of the hypervarlable region at the zeta globin locus (PNAS, 1983, ££, 
5022-5026), or of the Ha Ras locus (Nature. 1983,222, 33-37). Also 
Incorporated by reference In this application are a panel of probes 
proposed by Ray White. The flanking sequences of these may be 
elucidated using known techniques. 

The extended flanking polynucleotides prepared according to the method 
of the invention represent a further aspect of the Invention. These 
polynucleotides may be present in single or multiple copies. Where 
multiple copies are present these are faithful copies and preferably 
substantially free from networking or cross linkage between individual 
strands. By the term ftlthful we mean that thn genetic 
characterisation Information to be obtained from the size and 
composition of the copy Is essentially the same for all copies. 
Conveniently the extended primers comprise a sequence Identical with 
or complementary to an informative genetic locus of more than 1 
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kllobase. Other convenient values include 1.5 and 2 kll bases. 
Conveniently the number £ eoples represent the pr duets f at least 
3, 5, 7, 9, 13 or IS cycles of the cmpllficatl n method of the present 
invention. 

According to a still further aspect of the Invention there is provided 
a mixture containing multiple faithful copies of extended primers (as 
hereinbefore defined). As above the mixture is substantially free 
from networking or cross* linkage between individual strands. 

The primers for use in the present invention may be prepared by 
methods analogous to those known in the art. For example where a 
given flanking sequence is known, a convenient nucleotide primer may 
be prepared by direct synthesis. Cloning techniques may also be used 
to reproduce DNA fragments containing sequonces flanking informative 
genetic loci. 

Alternatively DNA fragments comprising informative genetic loci may be 
Identified and using a procedure analogous to that of the methods 
outlined above, a nucleotide sequence hybrldisable to the informative 
genetic locus may be extended. The product so obtained may then be 
directly modified, for example by cleavage, to prepare a convenient 
primer or the sequence thereof may be determined and convenient 
primers then prepared by direct synthesis. 

As explained above the sample DNA is generally double stranded. It is 
therefore desirable to separate the strands of the nucleic acid before 
it can act as a template for extension of a primer. Strand 
separation can be accomplished by any suitable method including 
physical, chemical or enzymatic means. Conveniently heat denaturatlon 
Is used to provide up to about 99X denaturatlon. Typical temperatures 
used include 85-105°C for times ranging from about 1*10 minutes. 

A primer Is preferably used for each unique strand of the sample DNA. 
In respect of double stranded DNA two primers are therefore normally 
used. Generally the primers are chosen to allow extension to proceed 
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across the informative locus In the same direction along both strands, 
that la t aay 5* to 3* or vica versa, Prefarably extension pr caeds 
5' to 3'. 

Primers are conveniently hybridised with the genomic DMA sample under 
known conditions. Generally this is allowed to take place in a 
buffered aqueous solution, preferably at a pH of 7*9. Preferably a 
large excess of primer la present in the reaction mixture. It will be 
appreciated that in certain diagnostic applicationa the amount of 
sample DNA may not be known but an excess of primers is desirable. 

The amplification la conveniently effected by adding the 
deoxyribonucleoside triphosphates dATP, dCTP, dCTP, and dTTP to the 
synthesis mixture in adequate amounts and the resulting solution may 
then, for example be heated to for example about 90°*100°C for a 
period of for example from about 1 to 10 minutes. After this heating 
period the solution may be allowed to cool to 30«70*C which is 
preferable for the primer hybridisation. Uhere Taq polymerase is 
used as the inducing agent to facilitate the extension reaction it is 
preferable to effect primer hybridisation at a temperature of from 50- 
70'C for example 50°-65°C ( such as 60°C, 

It has been found that primer hybridisation is advantageously effected 
in a buffer of reduced ionic strength and at an elevated annealing 
temperature. This reduces the possibility of misprinting. The 
expressions "reduced ionic strength* and "elevated annealing 
temperature" are to be understood as referring to conditions at the 
extremes of, cr outside those ranges which would generally be 
considered appropriate by the molecular biologist of ordinary skill 
for a given nucleic acid amplification reaction. By way of 
illustration but not limitation a convenient buffer will be of 
sufficient ionic strength to enable a stable pH of 7*9 to be 
(maintained. Convenient annealing temperatures are generally S0*60°C. 
It will be, appreciated that the actual reaction conditions will depend 
on the primer(s) used. 



KJ - KJ 
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An appropriate agent f r Inducing r catalysing the extension reaction 
may then be added c the annealed mixture and the reactl n may then b 
allowed to proceed under conditions known In the art. 

The Inducing agent la conveniently an enzyme. Suitable enzymes 
include the Klenow fragment of E. coll DNA polymerase X, T4 DNA 
polymerase, but particularly Taq polymerase. Other available DNA 
polymerases, reverse transcriptase and other enzymes, Including heat 
stable enzymes which will facilitate the extension reaction may also 
be used. 

The newly syntheslsed strand and Its complementary nucleic acid strand 
form a double* stranded molecule which Is used In the succeeding steps 
of the method. The steps of strand separation and sequence extension 
may then be repeated as required using the procedures outlined above. 
Suitable conditions for such procedures are outlined In US patent nos. 
4683202 and 4683195 Incorporated by reference above. As explained in 
the above patent specifications the separation and extension cycles 
may be performed stepwise or advantageously a plurality of cycles are 
operated for example in a semi -automated or fully automated manner. 

In the conventional PGR technique the amplification reaction is 
generally allowed to proceed for up to about SO cycles. In contrast, 
acccordlng to the method of the present invention the upper limit for 
amplification is believed to be about 25 cycles when starting from a 
single copy of the sample DNA. Vhere larger starting amounts of 
genomic DNA sample are available fewer cycles of amplification are 
required. The convenient window of amplification cycles depending on 
the amount of sample DNA and the nature of the sequence will now be 
illustrated, but in no way limited, by reference to the following 
analysis: 

The following analysis refers to amplification reactions 
using 10^1 volumes and, with extension times of 15 minutes: 
Considering an individual heterozygous for alleles A And B, where A is 
1 nger than B It can be predicted that A will therefore amplify less 
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efficiently than 1. The lover limit o£ cycle number la dictated by 
the aenaltlvlty of the probee, Which can detect about O.i pg pc duct. 
T detect allele A, eufflclent cyclea aro needed to gtnarato >0.X pg 
allele A product. Similarly. >0.1 pg allela B must b« produced to 
detect both alleles. The upper limit of cycla number C u la limited 
by the yield of both allele* A and B. 

Ci and C u can be estimated aa follows: 
Let M - Initial mass of human genomic DNA (In plcograos). 
a - length of allele A (kb) 
b - length of allele B (kb) 
ga - gain per PCR cycle of allele A 
gb - gain per PCR cycle of allele B 
Since the diploid genome size of man la €xl0 6 kb then the Initial 
amount of allele A Is 

«• » Pg 

6 x 10 6 

Similarly, the Intlal amount of allele B Is 

M. b pg 

6 x 10 6 

The yield of allele A after C cycles Is given by: 

H. ■ g» c Pg 

6 x 10 6 

and similarly for allela B. 

From Fig. 3 the gain per cycla g la related to allele length 

L by 



gL - 2 • 0.093 L (L - 0«6 Vb) 



• • * « 
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Th« lover llmic f cycles is therefore tec by the 1 west 
value of C where 

Me (2 • 0.093.L) C > 0.1 

6 x 10 6 

end Mb 12 - 0.093.L) C > 0.1 

6 x 10 6 

The upper limit C u is likewise set by the highest value of C 
where the total yield of both alleles is < 1000 pg. I.e. 

M.a (2 • 0.093.L) C ♦ M b (2 • 0.093.L) C <1000 
6xl0 6 6 x 10 6 

and C u can be determined by computer reiteration. Some 
typical examples art: 







fe- 


fil 


Cu 


pg 


kb 


tch 






10,000 


S 


1 


7 


20 


1.000 


3 


2 


10 


24 


100 


6 


0.5 


19 


27 


6 


6 


1 


27+ 


32* 



+ not* that the window becomes very narrow and can disappear for 
very small amounts of DMA with widely differing allele sizes. 

* H.B. This model does not take into account the fact that 
significant stochastic variation in the number of target molecules 
exists in amounts of human genomic ONA <100pg (equivalent to 17 
cells). The analysis for 6 pg DMA corresponds to the amplification 
of a single target molecule of each allele. 

Homosygotes can be readily accomodated In the model, C u 
remains unaltered, whereas is defined by the number of cycl s 
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required to give >0.05 pg of tgfih allele (i.e. >0.l pg combined). 

Reaction* with different volumes can alao be accomodated. 
0} la defined by the number of cyclea required to give X),l pg of each 
allele in the total reaction. C u la defined by the number of cycles 
required to give <1000 pg total product per lOjfl of reaction. For 
large volume PCR reaction, will be unchanged but C u will be 
somewhat larger than with a 10^L reaction. 

The window of amplification cycles which generates an appropriate 
amount of amplification product for characterisation (0.1-4000 pg of 
product) is for example 10*15 cycles for 100 ng of genomic DNA, 18 
cycles for lng and 25 cycles for single cell amplification (6pg), The 
number of PCR cycles may need to be increased to detect larger alleles 
which amplify less efficiently. Depending on allele length 1000-10 6 pg 
of faithfully amplified product can be obtained. 

A particular advantage of the claimed method is that differences of as 
little as one repeat unit in an informative region may be identified. 
The method of the present invention la also believed to offer faithful 
reproduction of large Informative genetic loci, for example those of 
up to 15kb 9 such as up to about lOkb. 

It has also been found that the problems associated with the 
networking of amplified copies of sample DNA as well as the generation 
of incomplete extension products can lead to the appearance of 
significant amounts of single stranded mlnisatellite DNA of 
heterogeneous size. This single stranded DNA can produce significant 
levels of aspeclflc products in the PCR reaction. This problem may 
be overcome or at least alleviated by the use of enzymes which 
specifically digest or degrade single stranded DNA and which leave 
double stranded DNA intact. The preferred enzyme is SI nuclease. 
Digestion of the final PGR products with such an enzyme reaulta in a 
cleaner profile of PCR products revealed at the final detection step* 
Therefore in a preferred aspect of the present invention the method of 
the invention includes the use of at least one enzyme which 
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specifically digests or degrades single stranded ONA whilst leaving 
double stranded DRA intact, vhereby to ameliorate the problems of 
aspeclfic products of the PGR reaction. 

Detection of the amplified products may be effected by any convenient 
means. The amplified products may be identified and characterised, 
for example using gel electrophoresis and followed if desired by 
hybridisation with probes hybrldlsable to the informative loci 
contained therein followed by for example autoradiography where 
radiolabeled probes are used. Convenient procedures are disclosed in 
European Patent Application, Publication No. 238329. Alternatively 
more direct methods may be used. Separation of the amplification 
products on a gel may be followed by direct visualisation thereof, 
nirecc staining with for example ethidium bromide may be effected 
where a sufficient quantity of product is available. The flanking 
nucleotide sequences may alternatively carry a label or marker 
component and such label or marker may then be detected using any 
convenient method. Such labels or markers may include either 
radioactive and non radioactive components but the latter are 
preferred. 

The number of informative regions which can be amplified 
simultaneously is not believed to be limited, other than by practical 
limitations. For example up to twenty regions could be amplified 
simultaneously, conveniently ten regions and in particular up to 
eight, seven, six, five or four regions. In a preferred aspect six 
regions are amplified simultaneously. 

The flanking polynucleotides of the present invention may conveniently 
be provided in a diagnostic kit, A further aspect of the invention 
therefore relates to a kit comprising two complementary flanking 
polynucleotides capable of hybridisation to sample DHA regions 
adjacent an informative locus together with a control sample of DNA 
and instructions for their use. Examples of convenient and preferred 
probes include those outlined elsewhere in the specification. The kit 
may also include rea-ent nucleotides for extension of the flanking 
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polynucle tides acr as the informative genetic 1 cue and/or extensi n 
promotor such as an enzyme 

Examples of flanking sequences ta informative genetic regions which 
may be used in the method of the present invention are set out on the 
following pages. Also disclosed are details of a panel of single 
locus probes which are capable of hybridisation to informative genetic 
regions. This panel of probes was disclosed by Ray White at a 
meeting in Quant ico, Virginia in May 1988. Polynucleotide primers 
capable of hybridisation to regions adjacent to such informative 
genetic regions may be prepared as previously outlined and used in the 
method of the invention. Primers may be prepared in respect of any 
one such region or combination of regions to provide extension 
products of any desired panel of informative reglon(s). 
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pMSl.lb 5* flanking sequence extending into alnlsatelllte. 



5 • CCCATTTCCATAAACACCTATCCACTACCTAATACTACAAACTACCATACCACCTACTAC 
1 1 1 1 1 1 

3 • CCCTAMCCTATTTCTCCATACCTCATCCATTATCATCTTTCATGCTATCCTCCATCATC 

ATCCATTAACAATCTTAATAATTAATCACCCTCATTTCCCATTGATTTTAACTTTCCATT 

I t II > I 

TAGGTAATTCTTACAATTATTAATTAGTCCGACTAAACCCTAACTAAAATTCAAACGTAA 

TTATTACTCTCACCTCGTTTTACCAATCCCCrCCCTTTCCTTCATCCTTACACrAAAAAA 
1 1 1 | | 1 

AATAATC ACACTC CACCAAAATG CTTA CCGGAGG CAAACC AACTACC A ATCTC ATTTTTT 

ACMCAACACTAATGATTCTATCTC^TCCTTTtCTCTCATCAGCCTTGATCITTTTAA^ 

1 1 > 1 "\ 1 

TCTTCTTCTCATTACTAACATACACTACGAAAAGACACTACTCCCAACTACAAAAATTCT 

GAATTTGATAGTTTGAGACATAAAAATITrTGAAAAACCAACTCACGTTTCCAAACA 

I 1 \ \ \ \ 

CTTAAACTATC AAACTCTC TATTTTTAAAAA CTTTTTG CTTCA CTG CAAAG CTTTCTTTT 

CTGAAAACCAGAGCtCGATTCCACCCCCCCCACCCCAGCAAATTCACAAATCACCCCrTG 
1 ■— | ,| I I — — I 

CACTTTTCCTCTCCACCTAAGGTCCGGCCGGTCGGCTCGTTTAACTCTTTACTCGCGAAC 



CTCTGACTCACTGCCT 3 f 

1 MINI SATELLITE 

GACACTCAGTCACCCA 5* 
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pMSl.lb 3* flanking sequence, starting at the Eco Rv site about SObp 
from the end of clnlsatel lite 



EcoRV 
I 

5 1 TCCATATCTCATCCATGCGAGCCCACATTGGCTCCTCCTCGCCCTCATCCCTTGCATATC 

I I \ ■ -I > 1 

3 ■ ACCTATAC^STACCTACCCTCCCCTCTAACCCACCACCACCCCGAGTACCCAACGTATAC 

CTTCTTTGCTCCCTCTCCTGTAACAACCATCCTAACAACACACCTGTGCCCCTGCCCTCT 

1 1 1 1 H 1 

CAAGAAACTACGGACACGACATTCTTCGTACCATTCrTGTCTGCACACCCGGACCCCACA 

CACCACTCCATCTGCCAGCGACCTCTCCCTCTCGAAGCACCAGCAAAAACAGCTGTGCCA 
— 1 *\< 1 1 1 1 

CTCCTC^CXTACACCCTCCCTCCACACCCACACCTTCGTCGTCCrTTTTGTCCAaCCCT 

CGTCCCCCAGCCCAACCCACACACAACCACCCCCAGGCAAACAGTTCCAAGCTCCTTCCC 
1 1 1 1 1 1 

CCAGCCCCTCCCCTTCCCTCTCTCmCTCCCCCTCCCTTTCTCAACGTTCCACCAACCC 

CACCCKACCATCTCCTCATGGCCTCCTCCCCCCCTGTCACTTTCCCTATCTGTAAAATGA 3 • 

1 1 — 1 1 1 1 

CTCCCNTCCTACACCACTACCGCACGACCCCCCCACACTCAAACCGATACACATTTTACT 5 1 
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The mlnisatellite repeat sequence In clone XHS29. The region 
In the 39bp consensus repeat sequence similar to the 3' end of the 
mlnisatellite core sequence (CCACGTCGCCACGARG, Jeffreys et al t . 1985 ) is 
marked with asterisks* Deviations from the consensus sequence are shown 
for two blocks of 4 contiguous repeat units (a-d f e-h) cloned at random 
f i jrn XMS29. The 39bp repeat sequence consists of 13 diverged repeats of 
PyAG. 



consensus TTGCAGTAGCTGTGGCAGGAGGAGTAGCAGCATCAGCAG - ( YAG ) 2 3 
■ > >> > >»>»»» » > — ► 



• ■■■■ v^/ 
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PHS31 5* flanking sequence extending Into nlnlsatelllte 



5 • GATGGACTGGGAACCACCTGCAGTTAGCAGCAACCGTACAATGTTCTGCAAGGATTGAAG 
1 1 1 1 1 1 

3 1 CTACGTCACCCTTCGTCGACGTCAATCCTCGTTCGGATCTTACAAGACCTTXICTAACTTC 

CCACCCTTCTCTCAGCCCCTCGGAAACTCCCCTGCACATGCGCATCTGGCTCCACCACCC 

1 | I I 1 1 

CGTCGGAACACACTCCCCGACCCTTTCACCCCACCTCTACCCCTACACCGACCTCCTCCG 

GAGGAACATCCrGAAGTCCTCTCTCAGCCCCCCTCTCCGAGCCACCCCCCCTCCCCCACT 
1 1 1 1 1 1 

CTCCTTCTACCACITCACCACACACTCCGCCCCACACCCTCCCTGCCCCCGAGCGGCTGA 

CACTCCGCCCTCCTCCCCmCCTGCCCGCCCTCCCTCAGACCCCACGGCTCCCCACCTC 

1 1 1 1 —I 1 

CTCACCCCCCACCACCCCAAACCACCCCCCGGAGCCACTCTCCCCTCCCGACCGGTCCAC 

CCTCTCCCCCGCGAGCCACAGCCACAACCrAGGCACGCCAACCCCCATCCACAATGTTGG 
1 1 1 1 1 1 

CCAGACCGCGCCCTCCCTCTCCGTCTTCCATCCGTCCCCrTCCGCCTACCTCTTACAACC 

CTCTTCCCTTOCACCCTCCACCCTCCOCTTTTGCCTTTCGCCTTCCCCTCACCAGCGCC 
1 1 1 1 1 1 

CACAACGGAAACCTCCCACCTGCCACCCCAAAACGGAAACCCGAACCCCACTCCTCCCCC 



TCCCGGCTCACCACCCCCCATCAACCCCACCrCCCCrTCC 3 • 

1- 1 1 1 MINI SATELLITE 

ACCCCCCACTCCTCCCCCCTACTTCCCCTCGACCCGAACC 5 • 
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FLANKING SEQUENCE V TO THE HS31 LOCUS 
mNI SATELLITE 

HINISATELLITE 5 1 CCCCCCCC ATCCCCCTCT CCCCACCCTG TCCCCCTCTC CCCACCGCCT 
3« CCCCCCCC TACCCCCACA CCCCTCCCAC ACCCCCACAC CCCTCCCCCA 

CCACCTCTCC CCACCCCCTC CACCTCTCCG CACCCCCTCC ACCTCTCCCC ACCCCCTCCA 
CCTCCACACC CCTCCCCCAC CTCCACACCC CTCCCCCACC TCCACACCCC TCCCCCACCT 

CCTCTCCCCA CCCCCTCCAC CTCTCCCCAC CCCCTCCTCT CCCCATC 3 f 
CCACACCCCT CCCCCACCTC CACACCCCTC CCCCACCACA CCCCTAC 5 f 
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5* flanking <egucnce of pHS32,6 extending Into mlnlsatclllte 



5 • GATCACCGGTGAATTCCACAGACACTTAAAACCAAAAATAATAATTTCTTGAATACAGTC 
1 1 1 1 1 1 

3 « CTACTCCCCAOTTAAGGTCTCTCTCAATTTTCCTTTTTATTATTAAACAACTTATGTCAC 

ACTTCTAAATTTCTCTTCAAACAATCACTATCICACTATGTTCACTTCTTTGTTCTCCAT 
1 1 1 1 1 1 

TCAACATTTAAACACAACTTTCTTAGTCATACACTCATACCAGTCAACAAACCCGACGTA 

TTTAAAGTTGAACTTCCTCGTTCTCCTCAGCCCTAGTTTCACTAAACAACCTTTTCCAAC 3 • 

1 1 1 1 I 1 

AAATTTCAACTTG AACCACCAACACCACTCCCC ATCAAAGTCATTTCTTCC AAAAG GTTG 5 1 

MIKISATELLITE 
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Flanking sequencet of 33, A 



S 9 ATCAACCACC CACTCACACA TCAACTCCAA CTCACCAGCG CCTCCCACCA AACCCCCTCC 
3 1 TAGTTCCTCC CTCACTCTCT ACTTCACCTT CACTCGTCCC CCACCCTCCT TTCCCCCACC 

ACCCACAAuA TCACTTTCAC CTACTATACA CCACAAAACC AAAACCCAAG ATAAAAATCC 
TCCCTCTTCT ACTCAAAGTG CATCATATCI CCTCTTTTCC TTTTCCCTTC TATTTTTACC 

CTCGCGACCG GCACCGATCC CCGA CCCGGC CACACCCCAO CTGCTGACC A CCCGCCACCT 
CACCCCTCCC CGTCCCTACC CCCTGCCCCG CTCTGCGGTC GACCACTCGT GCCCCCTGCA 

MINISATELLXTE 

CCTAACCAGG TTCTCCTCCC CTCCTGCCTC CCTCTAGCCA CACCCTGCCT CCCCCAGGCC 
GGATTCGTCC AAGACCACCG GAGGACCCAC CCACATCCGT CTCCCACCCA GGGCCTCCGG 

CCAGCCCGCC TCTCACAACC CCCCGCCCTC AGCCCCACCC CACCACACCC CCTATGCTGC 
CCTCCCCCCG ACACTCTTCC CCCGCCCGAC TCCCC CTCGC CTCCTCTCGG GGATACGACG 

CCCCGTCCAG CCCCCCCCCT CACCCTCTCT TTCCTCAGAC CTCCCTAGTC TAACCCAAAC 
CCCCCACCTC CCGCCCCCCA CTCCCACACA AACCACTCTC CACCCATCAC ATTCCCTTTC 

CTCCCACCTC TGTCATTCAA ACCCATGCAT TACCTCCCCC TCCCTCCATT TCCCT 3' 
CACCCTCGAG ACACTAAGTT TCCCTACCTA ATCCACGCCC ACCCACCTAA AGCGA 5' 



• • • • 
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Planking sequences of 33.6 



TCTCACTACA CCACACCTCA CATTT CACCT TCGAAACT 
ACACTCATCT CCTCTCGAGT CTAAACTCCA ACCTTTCA 

MINI SATELLITE 

CGTTC CTCCTCACTC TGTCGTCTTT CTCTGTCCAG ACCTTCCCTT 
CCAAC CAGGAGTCAG ACACCACAAA CACACACCTC TGGAACGCAA 

CTTCCATTCA CTAATCTCTC ACCTGCTTGC 3 f 
GAACCTAACT CATTACACAC TGCACGAAOC 5' 
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FLANKING SEQUENCES 07 THE KS51 LOCUS 

5* CATCACCCAA CTTCCTCTCC CCTCCC CATA TCCTCCTCCA TACCCACTCT CCCACAACCC 
3« CTACTCCCTT CAACCACACC CCACCCCTAT ACGACCACCT ATCCCTCACA CGCTCTTCCC 

CCAGCCTCCC TTTCAGCCTC TCATCCACAG TCAACCCCAC TTCACGCTTT CTTACCCGAC 
CGTCCCACGG AAAGTCGCAG ACTACGTCTC ACTTCCCCTC AACTCCCAAA CAATCCCCTC 

CGGCTGCACC CAC 
CCCCACCTCC CTG 

MINISATELLITE 

CCTC CCTCCCCTTT CCCCATCCTA CCCCCTGCAT CCCAGTCTCG 
CCAC CCACCCCAAA CGCCTACCAT GCCCCACCTA GCCTCACACC 

TCTTAAGCAC ATCTCGACAC CCTCTCTCCG ACACACATAC TCCCCACCCG ACCTACAGCC 
ACAATTCGTC TACACCTCTC CGAGACACGC TCTGTGTATC ACGGGTCCCC TGGATGTCGC 

ACACCCTCAC CTCCTCAAAA TTTCCCAGCT TCCCACAGTC CTCAATCTCC AAACCACTCC 
TCTCCCACTC GAGCACTTTT AAACGGTCCA AGGGTGTCAG CAGTTACACC TTTCGTCACG 



CCAAAGGCCA CCTCCCCACA CTGCCACCGA ATTC 3* 
CCTTTCCCCT CGACCCCTCT CACCCTCCCT TAAC 5* 
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HS51 - Sequence across the adjacent Hinf I site 

1 AGTGGATTTT CCCACTTCCC TCTGTTGTCT ATAATGAATG ACCACTACAT 

51 TTTTAACCCC AAAAGCTTCT CATATGAAGA GGAGACTTCT TTCAGCAGCA 

101 GGCAGGCCAC CTTCTCTCAA GCTCGCCACC AACCACCAAC CCACTCCCTG 

151 GCCACGGCAG CTGACGCCCA GGCAGTGACC CCACACGGGA GCGCTCTCGA 

1 201 AAAGRCTGGA GCTCCCATGA C 
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flauklng sequences of the hypervarlable regton 5' 
to the human Insulin gene 
(AM. J. HUM. CENET., 1986, 39, 291-229.) 



5 ( CTGCCCCTCC TCTCCTAACG CACGCTCCCA ACTACCCAGC CAGCACCCAC CCCACCCCTC 
3' GACCCCGACG ACACGATTCC GTCCCACCCT TGATCCCTCC CTCGTCCCTC CCCTCuGCAG 

CCTCACTCCC ACTCTCCCAC CCCCACCACC TTCCCCCATC CATGGCGGCA TCTTCGCCCA 
CCACTGACCC TCAGACGGTG GCGCTCCTCG AACCGCGTAG CTACCCCCCT AGAACCCGGT 

TCCGCCACTG CCG HINISATELLITE AC AG 

AGCCCCTCAC CCC TGTC 

CAGCCCAAAG ACCCCCCCCC TCCACCCTCC ACCTCTCCTG CTCTAATCTC GAAAGTGCCC 
GTCCCCTTTC TCCGCGCCCC ACCTCCCACC TCCACACCAC CAGAT7ACAC CTTTCACCGG 

CACCTCACCC CTTTCCTCTC CTCCACACAT TTGCCCCCAC 3' 
CTCCACTCCC CAAACCAGAC CACCTCTCTA AACGCCCGTC 5 1 
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FLANKINC SEQUESCES OF 5' ALPM GLOBIN HVR 
(AM. J. HUM. GENET. . 1988. «3, 249-256) 



SEEKS — 3KSSSS5SSES 



KJ [J 
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FLANCING SEQUENCES OF TEE ALPHA CLOBIN 3* HVR 
(EMBO. 1986, 5., 1957-1863) 



ACTCCCACCT CCACCAAAAC CTCCACCTAA CC 
TCACCCTCCA CCTCCTTTTC CACCTCCATT CC 

MINISATELLITE 



ACCAACACCA ACACCCAGGC AATATCGACA TCCCTGATCC 
TCCTTCTCCT TCICCGTCCC TTATACCTCT ACCCACTACG 



CTCCAAACCA CAGCATCAAT CCCAACTCAC TCA 3* 
CACCTTTCCT CTCCTACTTA CCCTTCACTC ACT 3' 
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Flanking sequences cf hypervariable regi o n at the Zeta globln locus 
(PNAS i 1983, 80, 5022-5026) 



5 f ACACCCATCA A7CCCACCAC CACCACACACAT CCACCCTAAT CTCATCTTCT ACACACCAT 
3* TCTCCCTACT TACCCTCCTC CTCCTCTCTGTA CCTCCCATTA CACTACAACA TCTCTCCTA 

CGTCCTCACC TCACCACACC CACATTATTA CAAAATAACA CCACACCCTT CCCCTCCACC 
CCACCACTCC ACTCCTCTCC CTCTAATAAT CTTTTATTCT CCTCTCCCAA CCCCACCTCC 

CCCCACACAA CACTACCCAC AACCAGAAAG AAACCTCAAA ACCTGTTCCT GCAACCAACC 
CCCCTGTGTT CTCATCCCTC TTCCTCTTTC TTTCCACTTT TCCACAACCA CCTTCCTTCC 



TCTTCCTATT TTCAACGCCT 
AGAACCATAA AACTTGCCCA 



HI NX SATELLITE 



ACC 
TCC 



TACACGCACA AAACACTTCC TCCTCTCCCC CTCCCTTCCC CCTGGTGGTA CACCCCTTAT 
ATCTCCCTCT TTTCTCAACC ACGACACCCG CACCCAACCC CCACCACCAT GTCCGGAATA 

CTCCTCCCCT CACCATCTCC CCCCCCCTCT CCTCCACCCC CCTCCAACCC CATCCCCCAC 
CACGACCCCA CTCCTACACC CCCCGCCACA CCAGCTCCGG CCACCTTGCC GTACCCCCTC 

CCTCTCACCA CCAAACCCCC CCCTCCTTCC CAACACCCCC CTCAGCCCAG TCCCCTCACC 
CCACACTCCT CCTTTCCCCC CCGACCAACC CTTCTCCCCC CACTCCCCTC ACCCGACTCC 



CCCTTCCAAG CCACCCTCAT TTACCCTTCT TAACATCCTA GTCCAOGCAT CCCTCTCCCT 
CCCAACCTTC CCTCCCACTA AATCCCAACA ATTCTACGAT CACCTCCGTA CCGAGACCGA 

CATGCACGCA ACTCCAACCC CTCCTACAC 3' 
CTACGTCGGT TCACCTTCCC CACCATCTC 5« 



FLANKING SEQUENCES OF HA HAS LOCUS 
(NATURE, 302, 33-37) 



5' TCCTCCAACC CCCTTCCCCT CCCTTCCCCC AACTTCTACC TCTCCCCACA CCCACACACA 
3' ACGAGCTTCC CCCAACCCCA CCCAACCCCO TTCAACATCC ACACCCCTCT CCCTCTCTGT 

CCTCACTCCC CTCTCTCCTC ATCCTCCCTT CTCCTCCCCC CCCACACCCC CCCTGCTCCC 
CCACTCACCC CACACACCAC TACCACCCAA CACCACCCCC CCCTCTCCCC CCCACCACGC 

CCTCCCCTCC CACCGCCCCC CTTCACCCTC CCCCACCCCT CTCCCACCCG CACTTCTCCC 
CCACCCGACC CTCCCCGCCC CAACTCCGAC CCCGTCCCGA CACCCTCCCC CTGAACACCG 

CTCTCACCCT TCCCTATCCC TCACCTTCCG CCACACCTAC CACGCCATCC CCCTCCCTCG 
CACAGTCCCA ACCCATACCG ACTCCAACCC CCTCTCCATC CTCCCCTACG CCGACCCACC 

CCACCCCTCC AGCCAC MINISATELLITE 
CCTCCCCACC TCCCTG 

CACTCACCAC CTTCCCCATC CATACACTTC CCCACGCCAC CACCCCTCTA CGCCTCCCGG 
CTCAOTCCTC CAAGCCGTAC CTATCTCAAC CGCTCCCGTC CTCGGCAGAT CCCGACCGCC 

CTCCCACCCT CCCTCCTTCC ACACCGTCCT CCTCACTCCC TCCTCGCCGC GTCAGATCCA 
CACCCTCCCA CCCACCAACC TCTCCCACCA CCACTCACCC ACCACCCCCC CACTCTACCT 

CCTGACCCTC TCC 3* 
CCACTCCCAC ACC 5' 
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The invention will now be illustrated but not limited by reference to 
the following: 

ExamnT# 1 

MATERIALS AND METHODS 

<■> Preparation of genome bra. a n>n^T eoel d <j , h vbrid^M™ 
probes , 

Human DMA samples were provided by CEPH. Paris, or vera prepared from 
venous blood as described elsewhere 126). Oligonucleotides 
synthesised on an ABI 380B DNA synthesiser using reagents supplied by 
Cruachem were purified by cthanel precipitation and dissolved in 
vater. The 3.6 kb £gu.3A insert from human minisatellite clone MS32 
1 101 wi aubcloned Into the flajBHI site of pUCl3 [27]. Similarly, the 
alnisatelllte Inserts from recombinant M13 RF DNAs 33.1, 33.4 and 33.6 
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[5] were isolated as a 1.9 kb ££mHI*£££RX fragment, a 2.7 kb 
Sau3A-EcoRI fragra nt and a 0.7 kb fiajsHI-Ec. RI respectively, and 
subcloned into pUCIS dig st d with gamH I plus ££C;RI, to produce the 
plasmid series p33.1, p33.4 and p33.6. Appropriate 
minisatellite- containing DHA fragments were Isolated from restricted 
plasmid DNAs by electrophoresis through IX low gelling temperature 
agarose (Sea Plaque); gel slices containing DNA fragments were 
dissolved in water at 65° to a final concentration of 2^g/ml DNA. 
lOng aliquots of DNA were labelled with 32 p by random oligonucleotide 
priming (28] . 

(b) polymerase, chain m«i<?n 

Aliquots of human DNA, diluted if necessary with 5mM Tris-HCl (pH7.5) 
in the presence of 0.1 oligonucleotide primers as carrier, were 
amplified in IQpL 67mM Tris-HCl (pH8.8), 16mK (NH 4 ) 2 S0 4 , 6.7mM MgCl 2 , 
lOmM 2-mercaptoethanol, 6.7 aiM EDTA, 1.5mH dATP, 1.5mM dCTP, 1.5aM 
dTTP, 1.5mM dCTP (Pharmacia), 17C^jg/ml bovine serum albumin (DNase 
free, Pharmacia) plus lt/i of each oligonucleotide primer and 1.5 units 
Taq polymerase (Anglian Biolabs) . Reaction mixes in 1.5 ml Eppendorf 
tubes vera overlaid with 40^1 paraffin oil and cycled for 1 minute at 
95°, 1 minute at 60° and 15 minutes at 70° on an Intelligent Heating 
Block (Cambio, Cambridge). Final amplification reactions were 
generaly chased by a final step of 1 minute at 60°, to anneal any 
remaining single-stranded DNA with primer, followed by an extension 
phase at 70° for 15 minutes. 

(c) Southern blot analysis of PCR prgducta 

Paraffin oil was removed from PGR reactions by extraction with diethyl 
ether. Agarose gel electrophoresis of PCR products, Southern blotting 
onto Hybond N (Aaersham) and hybridization with 32p * labelled 
minisatellite probes were carried out as described previously [10], 
except that competitor human DNA was omitted from all hybridizations. 
Restriction digests and SI nuclease digestion of PCR products were 
performed by diluting 5^L PCR reaction with 25^ restriction 
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endonuclease or SI nuclease buffer (29] and dig sting for 3C minutes 
at 37 with 3 units restriction end nucleas of SI nuclease (BRL) 
prior Co gel electrophoresis. 

(d) Isolation and PCR analysis gf slnr.l? human cells 

Lymphocytes weie Isolate. 1 by diluting venous blood with a equal volume 
of lxSSC (saline sodium citrate, 0.15K NaCl, 15raM trisodiuin citrate, 
pH 7.0), layering over Histopaque- 1119 (Sigma) and centrifuging at 
2000g for 10 min> Cells at the interface were diluted with 3 vol 
lxSSC and banded again over Histopaque. Cells were pelloted by 
centrifuging at 2000g for 10 minutes, washed three times with lxSSC. 
with centrifugacion, and resuspended in LxSSC to 10** cells/ml. 

Buccal cells were isolated by diluting 0,5ral saliva with 5ml lxSSC and 
centrifuging at 2000g for 10 minutes. The cell pellet was rinsed 
three times with lxSSC and resuspended to 10 a cells/ml. 

Approximately 0.1^1 aliquots of the cell suspensions were pipetted 
onto a siliconised microscope slide and rapidly examined at lOOx 
magnification on an inverted microscope. Droplets containing a single 
nucleated cell were immediately diluted with 0.4^1 lxSSC and 
transferred to an Eppendorf tube using a disposable tip pipette. The 
microscope slide was re-examined to check that the cell had been 
removed with the droplet. 

Cells were lysed prior to PCR either by heating or by treatment with 
sodium dodecyl sulphate (SDS) and proteinase K [23]. In the former 
case, the cell droplet was diluted with 4.5^1 5mH Tris-CHl (pH7.5) 
containing O.l^N oligonucleotide primers, overlaid with parafffin oil 
and heated at 95° for 3 minutes prior to the addition of 5^1 2x 
concentrated PCR buffer/primers/Taq polymerase and amplification. In 
the latter case, the cell droplet was mixed with 0.5^1 SmK Tris-HCl 
(pH7.5). O.ljrfl primers plus 1J. 5mM Trls-HCl <pH7.5), 40mM 
dlthlothrelthol, 3*4 SDS, 50 ^g/ml proteinase K (23], overlaid with 
paraffin oil and digested at 37° for 45 minutes. 3«4> water were added 
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Tho yiolde of each 1MS32 allelo amplified by Kit wero quantified by 
scanning densltoaecry (Figure J). PCR produce* froa 0,1 ^ gonoale 
DKA accumulate exp none lolly oe looot up to oyolo 17. 

Tho gain In product por oyolo doorooooo monotonously with ollolo 
length, with lowor go Ins for 6 minutes eoaparod with IS alnutee 
extension tiaoi. Tho g*m versus allele lon|Ch curvoi oxeropoloco 
bock eo o gain por oyolo of approxlaaeely 2.0 for very shore alleles, 
Indicating ehoe eho offteloncy of denacuraclon ond priming oe ooch 
oyclo lo closo eo 1001 Final yloldo of on ollolo con bo calculated 
from ehooo curve* ; for or* ollolo A wieh goln g A por cyclo protone 
Inleiolly oe n molecules, eho ylold ofeor o oycloo lo approximately 
n g A c molecules. Tho aolir imbalance between olloloo A ond B of 
dtfforone lengths, orlolng through more efficient amplification of 
shorter alleles, Is given by (g*/gg) c oxaaple. ofeor 10 cycles 

of oapltf Icoelon vlch IS minutes extension tines, tho aoler yield of o 
1 kb ollolo will bo 1ft times higher then ehoe of o 6 kb allele; after 
2S cycles, tho labolonco will bo I 300- fold. This labolonco Is 
eaelloroced eo soae extent by eho more efficient detection of longer 
olloloo by tho alnlseeelllce hybridisation probe. Nevertheless, long 
olloloo Amplified by PCR will become increosingly difficult eo detect 
with low amounts of storting human genoaic DKA ond high numbers of PCR 
cycles. 

Mlnlsocolllees pJLg3, prtSSl, 33.1. 33.4 and 33.6 were also coscod for 
eholr ability to bo amplified by PCR (daea noe shown). In all cases, 
faithful amplification of all alleles coscod was observed, except for 
eho longest (> 8kb) alleles of p>g3 which as oxpocced failed to 
aapltfy. Again, yields of PCR product foil with increasing allele 
longeh. 

(C) Fidelity of amplification of etnylo mintaatell If aoleeulea 

To uest whether faithful amplification of single molecules Is 
possible, 60 and 6 pg all quo to of huaan DNA, equivalent to 10 and I 
colls, woro co*eapllfled for 2S eyeloo using p timers for both XHS32 
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and pMSSl (Figure 4A) . Both alleles of pMSSl (1.6 and 1.5 kb) 
amplified In cho tOpg sample, and Amplification produoti of ono or 
b eh alloloa were alio aaon In cho Ipg samples, Indicating that itngla 
target molecules can bo faithfully amplified. Similarly, tho 2.1 and 
S.9 kb alloloa of *NS32 could bo auocoaafully aapllfUd from 6pg 
eamplea of human DNA, although tho ylold of tho largor allolo waa aa 
oxpoctod vory low. Succoasful amplification of NS32 and pHSSl 
alloloa in 6pg samples appaarod to occur Independently, aa oxpoctod, 
with a mean falluro rata par allolo par raactlon of 63X. Proa tho 
Polaaon Attribution, thla Indloatoa on avorago 0.46 luccoaaful 
amplification events par 6pg DMA aaaploa, coaparod with I ovont 
pradlctod alnco 6pg human DNA will on avorago contain ono aolaculo of 
an allolo. Thus, single target alnlaatolllto aoloculoa can, with 
reasonable offlclancy, ba aapllflad by PCR. No apurloua amplification 
producta voro soon with pMSSl . In contrast, MS32 frequently gavo 
unaxpoctad producta in both tho (Opg and tpg DMA aaaploa 
(Plguraa 4A,B). SI nuclaaao dlgoatlon ollmlnatad ono band (Flguro 43) 
which appears only occasionally, which comlgrataa with danaturod PCR 
product and which can bo largoly ollmlnatad by chaalng tha PCR 
producta by a final annaallng/oxtonalon atap (data not shown, see 
Matorlal and Mathoda). Thla SI nuclease* sensitive band presumably 
corroaponda to single* stranded tomplato which fallod to prims In the 
final PCR eye Is, Tho remaining apurloua banda dotoetod byAMS32 vsra 
roslatant to SI nuclaaao but were roducod In also, along with tho 
corrocc PCR product, by dlgoatlon with roacrlctton ondonucloaaoa which 
cloavo DNA flanking tho mlnlaatolllto (Flguro 4B). These apurloua 
banda presumably represent abnormal PCR producta with normal flanking 
DNA but altered numbera of alnlaatolllto repeat unite. They are 
particularly prominent after 30 cycles of amplification, are generally 
preaenc In low amounts compared In the authentic allele, and vary In 
length from reaction to reaction. In contraat to tho parent allele. 
They are not the result of contamination of the PCR reactlona with 
human DNA or with producta of prevloua PCR reactlona, alnco they only 
appear In reactlona where aucceaful amplification of an authentic 
allele has occurred (Flguro 4A) and have been consistently seen with 
all human DNAa taated (data not ahown). Since almoat all of the 



« 54 • 

spurious produce* are shorter thin the authentic allele, tc Is likely 
that they erieo fairly early In the FCR roeetlon and accumulate 
preferentially due to their short length end ceneommltant hl|her 
efficiency of amp 11 fleet ton. It le not yet cleer how these "mutant" 
alleles arise, nor whether FCR conditions can bo found which will 
suppress their appearance. A stellar frequency of appearance of 
abnormal "allelce" has been eeen with pXg3, but only ouch less 
frequently with the other four alnlaatellltcs tested. 

Somatic nutations at •Inlstatelltte loci In the starting human genomic 
DMA could also be e source of unexpected <*CR products. Such mutations 
do exist, particularly for >MS32, as shown by the appearance of 
mutant alntsatellllte alleles In clonal tunour cell populations [30]. 
Soaatlc autants are however unlikely to be a aajor source of the 
spurious bands shown In Figure 4, since no FCR reactions on 6pg human 
OKA have yet been seen which show a autant allele appearing in tho 
absence of the noraal parental allele. 

(d) Co-anollflcatlon of multlpU alnltarellleea: PCT.derived hu^an 
DMA flngarorinri 

Figure 4 deaonstrates that two alnlsatellltes can be successfully 
co-amplified In the saae PCR reaction. Further analyses shoved that 
at least six alnlsatellltes could be co«ampilfied without any apparent 
Interference between loci. Furthermore, the FCR products could also 
be typed slaultaneoualy by Southern blot hybridization with a cocktail 
of all six alnsacelllte probes. Examples of such mult I locus 
FCR*derlved OKA • fingerprints' 1 are shown In Figure 5. In all cases, 
the FCR reaction was limited to 15*18 cycles, to minimise the 
appearance of spurious products as seen in Figure 4. Repeat analyses 
of the saae Individual showed that the pattern was reproducible, with 
all hybridising OKA fragments representing authentic alnisatelllte 
aapiliicatton products. OKA "fingerprints" could be readily derived 
from Itig huaan OKA; On occasion, one or two loci failed to amplify 
(individuals I, 12, Figure 3A, C); this failure usually affected 33.4. 
followed by pMSSl, and warn least likely to affect 33.1 (data not 



•hown). The ilkellho d of failure appears to correlate with the CC 
content of the mlnlsatelllte repeat unlet (Table 1), and suggests that 
non* amplification results fro* failure co denature CC-rtch 
mlnlsatellltco «c 9S° t probably due to localised variations of 
temperature In tho heating block or to poor thermal conductivity 
between tho block and tho reaction tube. 

These KH DNA fingerprints aro derived from sU loci vith widely 
differing leva It of variability (Table 1). To determine the overall 
complexity and level of variability of these pattern*, unrelated 
Individuals were compared (Figure SB). On average, 1.9 bands were 
resolved per Individual (range 6* 11. Figure t). The maximum possible 
number of bands Is 12 (Figure SA), corresponding to heterozygosity at 
all loci, vtch no electrophorectc comlgratton of alleles from 
different loci and with no alleles too large to be amplified by PCR. 
In palrvlse comparisons of unrelated Individuals, there are on average 
10.8 bands which are discordant between pairs of Individuals (range 
5*18), Sine the distribution of discordancies approximates to a 
Pols son distribution, then the chance that two unrelated Individuals 
would show Identical DMA fingerprints (no discordant bands) can be 
estimated at «* l0 <* - 2 x 10* V These patterns therefore show a good 
degree of Individual* spec If Iclty, despite the fact that four of the 
six loci used show relatively modest levels of variability (Table I) . 
Differences In FCR-derlved OKA fingerprints are also readily 
detectable between closely related Individuals. In for example the 
3* generation family shown In Figure SC in which faithful transmission 
of bands from parent to offspring can also be seen. 

(e) Tvptny «lnli«f lllc«« in ifnrl* humtn olla 

Lysed cells can be subjected directly to PCR without the need for 
purifying DMA (23] 9 which greatly simplifies the analysis of specimens 
such as blood. In preliminary experimaenta it proved possible to type 
mini satellites In blood by first fretting and thawing blood to lyse 
erythrocytes, followed by centrifugation to collect white cells and 
nuclei, and heating in water t lyse cells prior t FCR. Using this 
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aethod, it vat possible t reproduelbly type 0.001*0.01 j|i blood, 
corraaponding c 3O0 nucleatsd colli (data not shown), This analyeia 
was oxtondod to atnglo lyaphocytaa (Flguro 7A) froa which flvo 
ainiaatailite loci vara alauleanaoualy co«aapilfied and typad by 
eaquantlal hybridisation, Succaatful and raaaonably faithful 
aaplif lcatlon wae aaan both froa calla lytad with protalnata K and SOS 
prior to PCR t and froa calla lyaad by heating In water. Individual 
nucleated buccal calla could alao ba typad following protalnaaa K/SDS 
lyala (Pigura 76), though thaaa calla failad to lyaa in vacar (data 
not shown). 

To taat tha faaalblllty of ldantlfylng individual calla, 14 buccal 
calls froa two Individuals wars saparataiy typad in a b Undid 
axparlaant (Figura 11). In four casts, no aaplif lcatlon products wtra 
aaan froa any of tha loci, suggasting tithtr that tha call had not 
baan transfarrad to tha PCR reaction, or that lysis had not occurrsd, 
or that nuclaar DNA had dagradad prior to PCR. In tha remaining 10 
casts, aaplifitd allalaa could ba dattcttd froa at laast two of tha 
ainiaatallita loci, and in sent casas ail flva loci aaplif lad 
auccassfully froa a singla call* Oaitting tha iarga allalaa of 
HS32. which aaplif y poorly and would ba difficult to typa at tha 
ainglt call lavel, wa astiaata that, for thosa alngla call FCR 
rase c tons in which at laaat aoaa loci hava aaplifitd, approximately 
75X of allalaa prasant could ba datactad following FCR. This astiaatt 
agraas with tha afflciancy of alngla aolacula aaplif lcatlon determined 
froa FCR analysis of 6pg aaapla off huaan OKA (Figura 4), As axpactad 
froa Figura 4, aavaral Inatancaa of spurious bands wara a«an in both 
bucceal call and lymphocyte FCR reactions (Figura 7A,B). 
Nevertheless, distinguishing allalaa froa aach of tha two individuals 
tasted could ba datactad in tha 10 auccassfully* typad buccal calla, 
and tha origin of aach buccal call was successfully predicted in this 
blinded trial. 

Finally, wa note tha praaanca of aaplif lad producta of 33.6 In aoaa of 
tha DNA-free controla in Figura 7A, B and 33.4 tn ona of tha buccal 
call controla. In practica, wa hava found that auch contamination, 
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probably with recombinant DBAs or the produces of previous PGR 
rssetlons rsthsr than with human oslls or genomic ON A, can only bs 
s voided by using solutions! glassware, disposable plpsecs dps snd 
Eppendorf tubss which hsvo not previously bscn exposed to the 
Uboretory environment. It Is noteworthy thst our tost conslstsnt 
contamination problea hss been with J).*, one of the multilocua DMA 
fingerprint probee (S,<| which hsve been In continuous use In our 
laboratory for the last four years. 

(f) CONCLUSIONS 

Taq polymerase not only shows remarkable fidelity In raplifying 
non* repeated DMA (21), but Is alio capable of faithfully amplifying 
entire mlnlaatellltea and preserving the allelic apeclflclty of the 
number of repeat units. Unlike conventional PCI reactions t howsver. 
mlnlsatelllte PCR usually hss to be terminated before the yield of 
product becomes so high (Mng per 10^1 PCR reaction) that 
out-of •register annealing between complementary tandem* repeated 
template strands occurs during primer extension* particularly during 
the lengthy extension times needed to obtain efficient amplification 
of long mlnlsatelllte alleles. Also, the PCR reaction must proceed 
far enough to generate sufficient product to be detectable by 
hybridisation. The mlnlsatelllte probes are sensitive and can readily 
detect O.lpg mlnlsatelllte PCR product (101, The "window" of PCR 
cycles which generate an appropriate amount of product for typing 
(0*1*4000 pg product) Is therefore very wide, and only a very 
approximate estimate of the amount of initial human genomic OKA is 
needed to predict the number of PCR cycles needed for successful 
typing. As a gulda, 10*15 cycles are appropriate for lOOng genomic 
DNA, It cycles for Ing and 25 cycles for single cell PCR (6pg), The 
number of PCR cycles may need to be increased to deteet larger alleles 
which amplify less efficiently. 

Since minisateilite amplifications often need to be restricted to the 
exponential phase of accumulation of PCR products (211, then the 
hybridisation signal is approxlmatsly proportional to the amount of 
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Input DNA down c O.lng human genomic DMA (data not shown), Ba low 
this I aval, stochaatlc variation in tha nuabar f target mlnlsatelllto 
molecules can obacura tha proportionality, Hlnlaatailita PGR can 
tharafora bo uaad quant 1 tat Ivaiy to estimate lov eoneantrationa of 
human DNA. Al«o, tha amount of prlmara and Taq polymaraaa will not ba 
limiting during this oarly phaaa of PCR, and in prlnelpia thara should 
ba llttia or no intarfaranea batvaan 

dlffarant loci balng ampiifiad. In practice, at iaast alx dlffarant 
mlnlaataiiltas can ba co*ampilfled simultaneously, and thara aaams to 
ba no thaoratleal raason vhy this numbar could not ba incraasad 

fur tha r. 

Co*aapllflcation of mlnlsatallltas followed by slmultanaoua or 
sequential hybrldltatlon with mlnlsatalllta probaa anabias a 
conaidarabla amount of information concerning individual identity and 
family relationship to ba gatharad from vary small DMA samples. These 
PCR -derived DNA fingerprints appear to be reliable dovn to Ing human 
DNA. Information can also be recovered from much lover amounts of DNA 
and single colls, although the generation of spurious DNA fragments at 
some loci, during the relatively large number of PCR cycles needed for 
single cell typing, could presont significant problems for individual 
identification at the level ox one or a few cells. Fortunately, these 
spurious PCR products appea/ to vary from reaction to reaction, and 
duplicate PCR analyses of very small samples of DNA should therefore 
distinguish bona fide amplified alleles from spurious PCR products. 

PCR* derived DNA fingerprints already show a good level of individual 
specificity, with a chance of false association of two individuals of 
approximately 2 x W'V In contrast, conventional DNA fingerprints 
obtained by Southern blot hybridisation with a aultiiocua polycore 
probo (6) or with a cocktail of locus-specific human minlsatellite 
probes (10] show much higher levels of Individual specificity (<10* 12 
and <1<T* respectively). However, tha variability of PCR-derived DNA 
fingerprints could bo Improved substantially. First, highly 
informative alleles particularly at pig) and VfS32 cannot be detected 
above approximately • kb (Ing human DNA) or approximately 5 kb (single 
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€•11). This probles eeuld be overcome by using highly variable 
alnlsatellltes with a tors rsstrieted rsngs of sllsls Isngths. Such 
1 cl appear co bo scares sines high Isvsls of variability ars uaually 
assoetatad with Urge nuabera of alnlsatelllte repeat unite and long 
allslss [S, I, 10), Som possibly appropriate loot havs however bsan 
Isolated (9), J. A, L, Amour and A.l Jeffreys, unpublished data)* 
Second, the number of alnlsatallltss being amplified simultaneously 
could be increased. Third, loci which are particularly prone to 
generate spurious PCR products, such as pJgJ and HSJ2, eould be 
Identified and avoided. If thess goals can be acceapltshsd, thsn ve 
see no reason why reliable identification at the tingle cell level 
should not be posslbls, provided that Inadvertent contaalnation is 
avoided and that the potential pressnce of soaatlc autatlons at 
hypervarlable loci Is taken Into account (301. 

The uss of aultllocus OKA fingerprint probes In for exaaple Individual 
Identification In forensic aedlclne, paternity testing and monitoring 
bone marrow transplants Is Halted by the ssnsltlvlty of these probes 
which require at least 0.1*lj|g human DNA for typing (6). Similarly, 
locus •specific alnlsatelllte probes can only type down to 
approximately SOng human DMA [10]. PCR- derived OKA fingerprinting 
Improves sensitivity by orders of aagnitude and can be used to type 
speclaens vhlch ars relatively intractable by conventional Southern 
blot hybridization. For exaaple, human hair roots typically contain 
I0*500ng DNA (22) and while approxiaately 70X of roots can bo typed 
using locus-specific alnlsatelllte probes (2, Wong. J. A.L. Armour and 
A.J. Jsffrsys, unpublished data), all hair roots so far tested can be 
typed by PCR-derived DMA fingerprinting (data not shown). Similarly, 
0.001-0. 01^1 blood can be typed without the need first to purify DNA. 
Likewise, saliva contains typically 100*1000 nucleated buccal cells 
per pi, and FCR*dcrlved DNA fingerprint analysis of subaicrolitre 
saaples of saliva is therefore possible. The potential for typing 
tracs aaounts of hair, blood, a amen, saliva and urine in forensic 
speclaens, including partially degraded saaples. Is obvious. The 
potential for Inadvertent contaalnation of speclaens, for exaaple with 
traces of spittle. Is likewise evident. 



• 62 • 

PCR« derived DMA fingerprints should eventually become sufficiently 
Individual* spoolflc to provlds a highly polarlssd tsst for 
astabltahtng parantago In for example patarnlty disputes. Not only 
would tha naad to Isolate OKA be obviated, but auch smaller sanplts of 
blood obtained by f lnger*prloklng rather than venipuncture could ba 
used. Alternatively, the determination of parentage could be based on 
the analysis of saliva. This would avoid the problem of individuals 
who object to giving blood aasiplea on religious or other grounds, and 
would remove the trauma of taking blood from Infants. 
EMBBlt I 

S&TKRIALS AND METHODS 

AnMlflcatlon of minlaatalllto all«l»i bv PCft 

Oligonucleotide prlaers ware prepared as described previously 
'.Jeffreys tl Al. Nucleic Acids Res, J£, 10953*10971). *MSJ2 
minlsatelllec allalas were aapllflad by mixing 0.4^g human genomic DNA 
In <4_1 H 2 0 with 50^1 43nM Trla-HCl <pH8.8). UmM (NH 4 ) 2 S0 4 , 4.3mK 
MgCl 2 , A.S^t EOTA. 7mM 2«aercsptoethanol % ImM dATP, laM dCTP. ImM 
dCTP 9 ImM dTTP (Pharmacia), 110^g/al bovlna serua albumin (DSase free. 
Pharmacia), l^fi oligonucleotide prlaer A and l^C ollgonuclaocida 
prlaar fi. 7.S unite Taq polymerase (Caablo type III or Perkln Elmer 
AapliTaq) were added, the reaction overlaid with 40^L paraffin oil, 
and cycled at 95° for 1.3 minutes. 67° for 1.3 ainutes and 70° for 9.9 
atnutea for 30 cycles on a Techno Programmable Drl*Block PHC-1, 
followed by a final chaae at 67° for 1*3 minutes and 70° for 9.9 
minutes* Equivalent results ware obtained on a Parkin Elaor Cetuj DN'A 
Thermal Cycler, cycling at 93° for 1 minute, 67° for 1 mlnuce and 70° 
for 10 minutes. 

Isolation of amplified allelei 

50 J. PCR reactions were mixed with SJ, loading mix (12* St flcoll 400, 
0.2X bromophenol blue. 0.2H Trie acetate (pH3.3), 0.1M Ha acetate, ImM 
EOTA). loaded onto a IX agarose gal (Sigma Type Z) and electrophoresed 
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ln th. pr...ne. of .thldlu. bfldo. A^llfl- AUU. *«. 
vl.uall..d u.lng a ion, v.v..l.n,th u/v wand (or a *•>• v.v.l.n, h 
^ tran.lllu.in.tor if «.c..ry> and g.l .He., .ont.ln «. .ILL. 
*.r. .«ol..d. OtU wa. r.eovor.d by .l.ctroph.r..l. onto dUly.l. 
..ubr.n*. .th.nol pr.clpltat.d and dl..olv.d In 10,1 H 2 0. 
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U. hav. pr.vlou.ly d..ctlb.d how .Inl.at.lllt. .ILI« «« b. 
^pllfl.d u.ln, ollgonucl.otld. prl..r. corrcpondln, to 0»A fLnktng 
th. .InL.t.lllt. to drlv. th. -pllflctlon of th. .ntlr. t.nd.» 
r.p..t .rr.y. By lUltln, th. nu-b.r of PCR cycl.. .nd d.t.ctlng 
^Ufl.d .11.1.. by Southern blot hybrldla.tlon vleh . .lnl..t.lllt. 
prob.. v. .howd that faithful -pllflctlon «.. po..lbl.. but th.t .1 
high eycl. nu»b.r. th. product. eoll.p..d to . h.t.rodl.p.r.. — «r of 
upllfl.d product, du. to «nn..llng b.tv..n .lngl...tr.nd.d 

t.nd..r.p..fd .InL.t.lllt. ONA fr.g«.nt.. Al.o. «o.t of th. ?CR 

product. d.t.ct.bl. .t high cycl. numb.r. on .n .thtdlu- bro-ld. 

•t.ln.d agar... g.l did not corr..pond to PCR product, of th. 

. ln l..t.Ult. locu. but «... by .l.prl.lng .t oth.r g.no.lc .It.. (A. 

J, J.f fr.y. »nd V.Ull.on. unpubll.h.d data). 

By d.cr...ing th. lonL .tr.ngth of th. PCR buff.r .nd lncr.a.ing th. 
KR .nn..ilng t..p.r.tur. to r.duc. .LprLlng. a. v.U changing 
th. .oure. of th. Taq poly^ra... It h« provd po..lbl. to amplify 
.tnl..t.llit.. to th. point vh.ro .uth.ntle PCR..*)lifLd .U.I.. c.n 
bo dlr.ctlv vt.ualL.d on .n .thtdlu. .t.ln.d g.l (Pigur. 9). So«. 
background .-.ring vhleh r..olv. into dl.cr.t. low .ol.cul.r wight 
DMA fr.p».nc. do., occur. .. . r..ult of th. l.rg. nu-b.r of PCR 
ey el.. n..dad to g.narat. thl. -ount of product uhlch produc... by 
.nnaallng. h.t.rodi.p.r.o .lnl.at.lUt. product.. H.vrth.l... . 
.uth.ntic allol.. ar. cl.arly idantifiabl. and eorro.pond in .l«. to 
tho .ini..t.mt. allol.. d.t.ct.d by conv.ntion.1 South.m blot 
hybridisation of huaan g.noalc DNA. 
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aero ONA controls were olio negative (data not ah vn) . From these 
data, 421 of tlnflo mlnisatelllte molecules |ivt rlao to PCR products, 
an of f leiency olmiiar to that reported previously (Jeffreys tt si. . 
1981) and olmiiar to the efficiency of amplification of larger alleles 
(Figure 12A legend). The 11 alleles amplified from single molecules 
were re*amplifled with the nested pr inert CI and D, to minimise 
subsequent contamination of single molecule PCR reactons and to 
eliminate spurious products arising from mlsprlming with primers A 
and/or B elsewhere in the genome, and subjected to internal mapping. 
All maps obtained were indistinguishable from the maps of alleles i 
and 4 obtained by amplification from bulk genomic DMA (data not 
shown) . 

topUficitlon of ntv dtlrctUn imtants of from slst* i«Uc:sd 
huaan itnoalc m\ 

The ability to amplify faithfully single molecules of MS32 make it 
possible to amplify and characterise && novo mutations at this locus 
present in bulk huaan genomic ONA. An individual was chosen who was 
hotorosygous for alleles of similar length but very different internal 
structure (alleles 31 and 32, Figure 12). PCR analysis on 40 
duplicate aliquots of 30 pg sperm ONA from this Individual shovod 
efficient amplification of single mlnlsatelllte molecules with no 
evidence for additional abnormal length alleles in any reaction 
(Figure 13A). 

Sperm ONA from this individual was digested with £*u3A plus the 
Isoschlsomer tfh&l. which cleave distal to primers A and B (Figure 
llA), and else fractionated by agarose gel electrophoresis. A series 
of fractions containing OKA fragments smaller than the progenitor 
alleles 31 and 32 were collected* and aliquots of each fraction 
amplified with primers A plus B* Southern blot hybridization revealed 
single molecule amplification events in these fractions, giving 
discrete PCR products whose else fell within the also range of the OKA 
fractions tested (Figure 13B, Table 2), Multiple parallel control PCR 
reaction containing no DMA, or containing aliquots of equivalent size 
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fractions froa a control SiuSA/ftftl digest lacking spers DNA (sot 
Experimental Procedures), were consistently negative f r MS32 PCR 
products. Tho saallose slse fractions S3-SS froa opera DNA voro 
completely devoid of aapl if labia precursor alleles. Indicating thst 
>99.9999t of precursor allele aolocules hes been ellalneted froa these 
fractions. Trace aaounts of these alleles voro still present tn 
fraction SI ana S2 Indicating tncoaplete purification despite repeated 
rounds of gel electrophoretlc fractionation. Cel purification of DNa 
close In else to the progenitor alleles yielded aore heavily 
contaalnated fractions in which the detection of new mutant alleles 
was hindered by PCR products froa the progenitor alleles (dsts not 
shown) . 

Single aolecule alnlsatelllte deletion autants were similarly 
aapllfled froa sirs- fractionated DNA prepared froa blood taksn froa 
the saae Indlvdlual. Again, single aolecule Amplification ovonts wore 
rsadlly detectable In blood DNA, but not In control digests lacking 
blood DNA. Details of the nuaber and sizes of new mutant sptrn and 
blood alleles detected are suaMrlred In Table 3. A total of 64 spera 
and 42 blood autants wore observed, with mutants occurring most 
frequently In the largest size fractions. 

The sblllty to align all spera and blood autants with ono or ocher 
progenitor allolo provides strong supporting evidence that thsso short 
alleles represent bona f Idq autant aolocules present in genoalc DNA. 
rather than contaminants Introduced during the Isolation of spera and 
blood DNA. Furthsraort , the repeated detection of Identical mosaic 
autant alleles further confirms the fidelity of single aolecule SWR 
mapping. 

EXfERMEHTAL PROCEDURES 
Prtpirstlon of DNA 

Huaan DMAs were provided by CEPH, Paris, and were also prepared froa 
venous blood collected from 80 unrelated English Individuals as 
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described elsewhere (Jeffreya «nd Morton, 191?). Blood DMA for single 
•olieuli PCH analysis vat preparad from 20ml bio d collected Inc 20al 
1 x SSC (saline sodium citrate, 0.13 NaCl, ISmtt trlaodium citrate pH 
7.0). The blood waa froaon and thawed to lyat erythrocytes, and 
leucocytes oollaetod by csntrtfugatlon at 10,000* for 10 minutes. Tha 
coll pallat waa rlnsod with I x SSC, resuspended In 0 2M Ma acetate 
(pH 7.0) and lyeed by tht addition of SDS to IX. OKA was collected, 
following phtnol extraction, by throa rounda of othanol precipitation 
All oparatlona vara performed In a laminar flow hood ualng pipettes, 
dlapoaabla plaatlewara and reagents which had not boon previously 
exposed to tho laboratory environment, to minimise tha risk of 
contamination. Spana OKA was similarly praparad (torn a single 
ajaculata from an Individual who had avoldad sexual intercourse prior 
to giving tho aaapla (to prevent contamination with DMA from his 
partner). Samon waa diluted with an aqua I voluaa of i x SSC end sperm 
collactad by cancrlfugatlon at lO.OOOg for 10 minutes. Tha coil 
pallat was rinsad thraa times with 20ml 1 x SSC. with centrifugation. 
followod by alx washea with 20ml i x SSC. IX SDS to iyso any seminal 
laucocytas and opithalial colls. Tho residual sperm pellet was 
Incubated In 1 ?< SSC, IM 2*mercaptoethanol at room temperature for 5 
minutes, and the reduced sperm lysed by the addition of SDS to IX. 
Sperm DMA was collected after phenol extraction by ethanoi 
precipitation. 

SI20 fractionation of human P?U 

4C^g blood sperm DMA was dlgeated to completion with Saul A plus ttbo l 
in the presence of AmK spermidine trichloride using conditions 
recommended by tho manufacturer. Tho OKA digest, together with a 
control dlgeat lacking DMA, waa loaded Into a horizontal 20cm long 
0.6X agarose gel (Sigma typo I) In TSE (20mN Trla acetate, lOmM Na 
acetate, O.lmH EOT A, pH 8.3) such that the loading aloe was less than 
half filled with sample. Samples were electrophoresed alongside 
HJLnd III OKA markera at 20V for 18 hours. Tho marker tracks were 
removed and stained with othldlum bromide. Tha gal region containing 
human OKA fragments ranging from 0.8 • 4,2kb was excised, along with 
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tha e rrasp ndlng rsgl n froa tht control digest, snd DNA e iioctsd by 
electro-elution once dialysis aaabrtna. The recoverad DMA, plus ths 
c ntr 1 extrsct, wsa re fractionated as abova on t frash gsl. 
Rac vared DMA *ms then aiactrophorastd through a third gti, snd a 
iirtci of gsl slices froa 0.4 • 3.8 kb taksn. Tho cop snd bottom 1mm 
of go I was trlaaad froa sseh sites to minimise contamination with DNA 
frsagsnts migrating absrrantly along ths gsl surfaca, and DNA 
collsctsd by elect roe iutlon and sthanol prsclpltatlon. DNA was 
dissolved in 3aN Trls*HCl (pH 7.5). All opsrstlons vors psrformed In 
a laalnar flow cabinet, and slsetrophorssls squlpasne vss 
dscontaalnatsd prior to uss by iaaorslon In doaostlc blsaeh plus 1M 
HCl In s fuas hood for 16 hours. 

AwUflcitUn of MS32 alltlta 

Allolss wsrs aapllflsd by a aodlficatlon of previously described 
as t hods (Jsffroys li ai. 1981). 0.4^g human gtnoale DNA was amplified 
In 50^1 45mM Tris-HCi (pH 8.8), llaM (NH 4 ) 2 S0 4l 4.5mM MgCl 2 . 6.7mM 
2-marcaptoathanol, 4,3^M EDTA. lmM dATP. loM dCTP t laM dCTF , ImM dTT? 
(Pharmacia), 110 pg/al bovlns ssrua albumin (DNaso froo, Pharmacia) 
plus IpK oach of ollgonucisotldas CI plus 0 9 or C plus Dl, and 5 units 
Taq polymerase (Aasrshaa). Reactions wsrs cycled for 1.3 minutes at 
96°, 1 alnuts at 67° and 4 minutes at 70° for 25 * 30 cycles on a DNA 
Thermal Cycler (Parkin Elmer Cetus). PCR products wars 
elactrophorasad through a 0.8X agarosa gal, and amplified ailalas 
visualised by staining with ethidium broalda. Alleles were recovered 
by eiectro-elution and echsnol preclplttelon and dissolved in 5mM 
Tris*HCi (pH 7.0); Other sourcas of Taq polymerase wars squally 
sffsctlva (Amp I i Taq (Parkin Elaar Cetus) and Type III Taq polymerase 
(Caabio).) 

Huaan DNA samples for singia aolacula analysis wara csntrlfugad to 
remove any particulate matter, diluted with SaH Trls*HCl (pH 7.0) plus 
O.i^M PCR primers as carrier, and aapllflsd In 7^1 PCR raactions using 
prlaars A plus B at 96° for 1,3 ain t (4° for 1 alnuta and 70° for four 
alnutas for 25 * 28 cycles, PCR products wara datactsd by Southern 
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blot hybridisation with • Up-Uballtd NS32 Blnlsatalllta proba as 
d#«rlbad pravioualy (Jaffrays gf ti> , 1988} PCR produces vara 
raaapllflad by aaadln| a 30ajl PCR raaetlon containing l^M nested 
prlaara CI Plua D. or C plus 01. with 0.4^ of tha Initial PCR 
raactton and cycling at M° for 13 alnutas. aS° f*r 1 mlnuta and 70° 
for 4 alnutas for 23-30 cyclos. Ra*aapllfled allalaa vara purlfi.d by 
agarosa gal alacerophorosts aa dascrtbad abova. 
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T*M« l la tend Spans and blood DNA froa an individual hetsrosyg us 
for NS32 loiSA aUtUs 4.9 and S.l kb long (alleles 31 and 32, Figure 
12) wars digested wich SluSA plus &&t and slse- fractionated by 
agarose gal electrophoresis as daacribad in Experimental Procedures . 
The yield of DMA in each fraction vaa estimated by electrophoresing 
aliquot a of aach fraction against a dilution aeriaa of unfractlonated 
digested DNA, followed by scanning densitometry of the appropriate 
•oleeular might region on photographs of an ethidiua bromide stained 
gal. Multiple aiiquota of each fraction vera amplified using PGR 
prlaers A and B and tastad for nav mutant ailalas as described in 
Pigura 13. The number of haploid genomes analysad for mutants *f a 
given site class was derived froa the yiald of each fraction, assuming 
3pg DNA per haploid genome and corrected for the estimated proprotlon 
(40X) of single HS32 molecules which successfully amplify. The 
observed number of repeats in aaeh mutant molecule was estimated from 
the size of tht initial product amplified with primers A plus B, and 
confirmed by direct determination of the number of repeats in mutant? 
subjected to internal mapping. 
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rifura 1 Prlaers and hybridisation probes used in the aapllf leaclon 

f alnlaateilltes by PCX. Bach alnlsateillte 1 cut wa* aaplifled 
using 20* r H**tr pr Intra A and I located In unique sequences 
flanking DMA a and b bp respectively froa the alnlsateillte. PGR 
produces were detected by hybridisation with an Internal alnlsateillte 
probe Isolated by cleavage with restriction endonucleases X and Y 
which cleave c and d bp froa the alnlsateillte. Details for Che six 
ailnlsatallltes are as follows, where R- length of repeat unit (bp): 
p g3. A • 5 1 • ACCACACCCACACTAACACC O \ B • 3 • • CCACrCTCCTTACACCAATC 0 1 . 
X - tot I. Y • fidtt. a • 33, b - 58, c - 26. d • 45, R - 37; MS32. 
A - 5 * • TCACCCCTCAATTCCACACACACT • 3 * , B • 5* • AACCTCTCCATTTCCACTTTCTCC • 3 * . 
X - Hail. Y - axlL * - 181. b - 324, c • 10. d • 37, R • 29: pMS5l. 
A • 5' CATCACCCAACTrCCTCTCCCCTCO' . B - 5* .TCCACATTCACCACTCTCCCAACC- r . 
X - fife!. Y - Bail!. * - 117. b - 131, c - 10. d • 0. » - 25 ♦ 11 
pil l. A - 5 1 • CTTTCTCCACCCATGCCATCCCAC O 1 , 

B • 5' ■GCCCTCTCACCCACAAC C 7T C T C C>3 t , X - Qda l . Y • Rt^ I a • 6. b - 27. 

c • 0, d • 15, R • 62; p33.4. A - 5' .CCCCCCCACACCCCACCTGCTCAC- 3' 

B - 5* •CCACCATAGCCCCTCTCCTCCGCT.3\ X - Qdil . Y - fifeX, a - 11. b - 96. 

c - 0, d - 2. R - 64; p33.6, A - 5* •TGTGACTAGACGAGACCTCACATT* 3* , 

B - 5'.ACCTCACACATTACTCAATCCAAC-3'. X - £1*1 . Y - Dralll. a - U. 

b - 45. c - 10, d - 16. R - 17. 

Piftira 2 Aapllflcatlon of MS32 alnisacalllte alleles by PGR. A. 
0.1_g allquots of DMA froa CEPH Individuals 2306, 10208. 133101 and 
133304. which together contain 8 different MS 3 2 alleles ranging froa 
1.1 to 17.9 kb, were pooled and aaplifled for 10*20 cycles in ICyi 
reactions containing I unit Taq polymerase plus flanking prlaers A and 
B. PGR products were separated by electrophoresis in a It agarose gel 
and detected by Southern blot hybridisation with a alnisatellite probe. 
Taq polymerase extension Class at 70° ware for 6, 15 or 30 ainutes with 
(♦) or without (*) addition of extra polyaerase (1 unit) at the 10th 
cycle. H. 7p$ of each CEPH SNA digested with &iul; AluX sites flanking 
MS32 are located such that each &lu! allele is 0.2 kb longer than its 
corresponding PGR product. Autoradiography waa for 5 hours (cycles 10. 
14) or 1 hour (17, 20) without an intenalfler screen. B, effect of 
increasing concentration of Taq polymerase (a*c, 0.5, 1, 2 units 



• 76 . 

r spectlvely) on the efficiency f sapltf Icatlon of large tlitui. The 
osteons ton ttao at 70 woo IS ainutes. 

Flyur* 3 Efficiency f aaplif icatlon f MS32 ainlsetellite alleles *s 
o function of allele length, with PCR oxtenslon tlaes of 6 ainutes (0) 
or 15 ainutes (0). Tht gain In product por aaplif icatlon cycle vat 
deteralnod by scanning laser dens It oat try of tracks H, end 15* of 
F I curt 2, exposed to pre -flashed X-r*y film without an intenslfler 
scroon. Tho aoan estlasces of gain por cycle dottralnod up to cycle 
10. froa eyclo 10 to U and from cyclt U to 17 wort in clost 
agreeaent, indicating that tho ylold of PCR product is Increasing 
exponentially at loaat up to eyclo 17, tht Flgurt shows tht moan value 
of tho throo estlaates of tho gain for each allele. 

riguf »*» Co-amplification of two alnlsattliitts froa slrglt ctll 
oqulvalonta of huaan OKA. A, 60 or 6pg aliquots of ONA froa blood froa 
on individual heterozygous for elWtta a, b at XMS32 and c. d at ?MS5l 
wort aapllfled for 23 cycles with 15 ainutts txttnslcn tlats In tho 
prosonco of prlaors A and 3 for both loci, followed by Southern bio: 
hybridization analysis of aaplif icatlon products. Low level* of allele 
a could bo detected in three of tht 6pg saaples on prolonged 
autoradiographic exposure (arrows). B, analysis of spurious 
aaplif icatlon produces of MS 32. Two 60pg aliquots of DNA were 
aapllfied for 30 eyelet, followed by digestion with SI nuclease (SI), 
till (B) or Ua*l (H). £gll cleaves once in tho flanking ONA, between 
the MS 3 2 ainisatellite and prlaer B, and reaovts 31lbp of flanking 
OKA. Uogl cleaves between priaor A and the ainisatellite. removing 
I95bp of flanking OKA (soe Figure 1 legend). 

Flftirt S Co-aapiif icatlon of six different human ainlsatoilltos by 
PCR, A, aaplification of lOng (flrsc four lanes) or Ing OKA (last two 
lanoa) froa individual 1 for IS or 18 cyclos respectively, using a 
cocktail of prlaors A and B for ainistatellitea p>g3, KS32. pHS5i. 
33.1, 33.4 and 33.6. PCR products wore detected by Southern blot 
hybridization with a cocktail of all six 32 P- labelled ainisatellite 
probes, Tho individual tested had been provlously characterised at 
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all tlx I ci separately, which nebled ell hybridising OHA fragmentt to 
be assigned at shown; thle Individual Is heterosygous at ill six loci 
Thes DMA fingerprints are from three separate experiments. Note that 
33 4 has failed to amplify In tho last tract B. OKA flnforprtntt of * 
unrelated Individuals (2*9) following amplification of lng samples of 
DSA for 16 cycles C. DMA fingerprints of ) -generation family (CEPH 
kindred U35)» following ampllf Icat ion of lOng ONA for 15 cycles 
Thrto bands, corrtspondlng to allolos of J J 4 and pHSU, failtd to 
amplify in individual 12, as shown by a second analysis of this faally 
(first tract, bands marked with an asterisk) In all experiments. PCR 
products wars dtgostsd with SI nuclease tsee Materials and Mothods; 
prior to gel eiectrophoreala. ro reduce background labelling, SNA- free 
controls in all experiments wort consistently blank (not shown) 

Piyur* 6 Variability of DMA flngtrprlnts productsd by co-aapUfyIng 
six mini satellites simultaneously by PCR. Ing saaplos of ONA from 21 
unrtlatad Individuals were analysad as dascrlbad In Flgurt 5 A. 
variation in tha number of resolvable ONA fragments par Individual 
The moan number of hands rasolvod par dlffarancas seen batwaan tha ?N'A 
flngarprlnt of pairs of unralatad individuals, based on 29 independent 
palrwlse comparisons The number of discordant bands is the total 
number of bands not shared by the two Individuals being compared The 
theoretical maximum number of discordancies with 6 loci Is 24, ani the 
observed mean it ICS t 2.8 (S O ) The distribution of discordancies 
approximates to a Polsson distribution with this mean (dots). 

Fl|ure 7 Amplification of minisateilltes from single human cells A. 
samples containing 0, I or ) small lymphocytes (from the individual 
analysed in Figure 5A> were iysed either with proteinase K plus SDS or 
by heating in water, followed by co-ampiif icetlon with primers for 
MS32, pKSSl, 33.1, 33.4 and 33.6 for 2? cycles. PCR products were 
Southern blot hybridized sequentially with each of the five 
mtnisateilite probes. B, amplification products of single buccal 
cells, analysed following lyaia with proteinase K plus SDS and PCR as 
aoove. Cells from two individuals, a and b were tested; b is 
homozygous at pKSSl. 33.1 and 33. o, 0. no cell control. Spurious PCR 
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pr ducta ar indlcat d with arr *• 

fUutW fl Strueturo of plaaald el on* p*S22S ft m d r*giona r*pr***nt 
atniaacolltc* arraya Thar ar* two dlattnct ajiniaatall ita ragiona 
dlacornlbl*. 22«A d*t*cta tha mora atrongly hybridising banda jnd 
th* fatntor banda on high atr Infancy hybrid Hat ton of huaan ON a 

fHurt f Dlr*et d*t*c*ion of aapUftad HS12 atnluttl I It* all.;., v, 
•t*ctr*phor*aia on *thidtuaataln«d agaroa* g*ls following 30 cycl«» 
of PCR on 0 H H g human <anomt<: DNA M-«4r«*r wNA ,0 S^g 3NA x \Sl^A ::: 
• 0 : M , Oxl?* ONA x hultl) Indivtduai 1 i. h.t*roxygoua for th. .. 
kb allol* d*t*ct*d on tho g*l and a latg*r «ll*l. <: kb) which 
AapUfm ouch Uu .ff'ctantly and cannot th*r*for* ba d*t*ct*d 
Individual 2 la h*t*ro 2 ygoua for ) ' and - * kb at !•!♦». both of vhic* 
**• **«««**U of tho ^plifici <orr**pn P .i ;h „ 

prodlctad tlsad dataratnad by conventional SsutUrn blot 4r.4ly$:» o: 
g*nomlc DSA from th*a* individual! Nota th* pr«a«nc* of an addlTlor.al 
compl.x aat of low mol.cular v.ight PCR producta corresponding to 
collapaad oiniaat* 1 1 1 alloloa C - no ZUA control 

1ft »«P of tho aaall.r SS32 ail*!* In CEPH md.vldual 

10202 a. aiafl (F) and JLuli: , H ) partial dlg.at producta of th* ?CR 
amplified ali*l*. following cl*avag* with fic^RI and and- laballlng at 
th* IfiflRl alto 8. th* coaploa.ntary patt*rn from th* saa* alio'.* 
and- labelled at th* C^l nt* C. aap of ^il 4 nd frilll cleavaga 
altoa In tho allol*. Th* locatlona of PCR prlaers A and ft ar* ahovn 

FlW" H Stf««»7 for mapping th* location of variant rop«at units 
within an MS32 alnlsac.il i t* allalo. A. location of PCR prison in 
relation to th* ainxsaceilic* rapaata (op*n box* a) and aurrounding 
ratrovlral LTR hoaologu* (fUl.d box). Restriction sleaa for JJiafl 
(F). HftftXXI (H) and Siy.3A <S) ar* also shown. Prima r sequences ar* 
A. 5 ' - TCACCCCTCAATTCCACACACACT • 3 1 ; ft. 5 1 • AACCTCTCCATTTCCACTTTCTCC • 3 ' . 
C S'-CTTCCTCCTTCTCCTCACCCCTACO': 0. 5 • CGACTCCCACATCGACCAATGCCC - 3 ' 
Derivatives CI and 0i with a 5' extension TCACCCCT Caattc containing an 
•fflcl*ntly-cl*av*d &O.M sic. (undarllnad) v«r« used to gen. race PCR 
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4 A Mth d at cUtMd In any one of tht previous claims wherein 
Alinlti of en Informative locus in tht tost sample ore of up Co 15 
kllobases . 

} A method as claimed In any one of the prevlout clalaa which 
includes the use of at least one enzyme which specifically digests or 
degrades single stranded DNA whilst leaving double stranded DNA 
Intact, whereby to alleviate tho formation of aspeciflc amplification 
products 

*> A method as claimed tn any one of the previous claims -fhorotn 
amplification Is effected. in buffer of reduced ionic strength and ,n 
an elevated annealing temperature whereby to alleviate misprinting. 

?. A method as claimed in any one of the previous claims wherein 
more than one informative locus is amplified simultaneously. 

8. A method as claimed in any one of the previous claims for the 
characterisation of one molecule of an informative locus In a genomic 
DNA test sample. 

9. A polynucleotide hybrldlsable to a single strand of a test 
sample of genomic DNA at a region which flanks the mlnlsatellite 
sequence of an informative locus so as to permit formation of an 
extension product which is complementary to and spans the said 
minisatelilte sequence of the strand of the test sample and wherein 
the informative locus is selected from any one of the fol loving: 
MSI, KS29, MS31A and MS31B* MS32, MSA 3 A and MS43B, MSSl, MS228 and 
MS228B. 
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10. A p lynucls tide extension pr duct prepared fr m a 
polynucU tide primer hybrtdlaabla Co 4 single strand f a gan mic DNA 
teat sample at a region which flanka cha minisatei I its sequence of an 
informative locus and wherein tha polynucleotide extension product is 
complementary to and apana tha aald mlnisatel 1 1 te sequence of tht 
strand of tha taat sample 

11 A mixture which comprise* multipU faithful copies of a 
polynuc loot Ida extension product .is claimed in cUla 10 above 

12 A mixture as cUitttd in cUta 11 4*>ov» which comprise* the 
products of at Uast 3 cvcUs of the amplification mathod as claimed 
in one of claims 1 * ? 

13. A kit comprising polynucleotide prima r for amplifying tha 
mlnisatelilte sequence at at laast ona informative locus In a tost 
sample of genomic ONA the primer baing hybridiseble to a single strand 
of the test sample at a region which flanks the miniaateiiite sequence 
of the informative locus to be amplified under conditions such that an 
extension product of tha primer Is synchesised which is complementary 
to and spans the said mlnlsateiiita sequence of the strand of the test 
sample; and tha kit further comprising a control sample of DNA and 
instructions for use. 

14. A kit as claimed In cicim 13 which comprises two complementary 
flanking polynucleotide primers in tespect of each informative locus. 
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